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ANTIBACTERIAL TARGETS IN ALLOIOCOCCUS OTITIDIS 

Field of the invention 

The present invention relates to the genomic sequence of Alloiococcus otitidis 
5 and polynucleotide sequences encoding polypeptides of the Gram-positive 

bacterium, Alloiococcus otitidis. The invention also relates to polynucleotides and 
polynucleotides encoding polypeptides, preferably antigenic polypeptides, encoded 
by the Alloiococcus otitidis open reading frames and the uses thereof. 

10 Background of the invention 

Since the discovery of penicillin, the use of antibiotics to treat the ravages of 
bacterial infections has saved millions of lives. With the advent of these "miracle 
drugs," for a time it was popularly believed that humanity might, once and for all, be 

15 saved from the scourge of bacteria! infections. In fact, during the 1 980s and early 
1990s, many large pharmaceutical companies cut back or eliminated antibiotics 
research and development. They believed that infectious disease caused by bacteria 
finally had been conquered and that markets for new drugs were limited. 
Unfortunately, this belief was overly optimistic. The tide is beginning to turn in favor of 

20 the bacteria, as reports of drug resistant bacteria become more frequent. The United 
States Centers for Disease Control and Prevention announced that one of the most 
powerful known antibiotics, vancomycin, was unable to treat an infection of the 
common bacterial pathogen, Staphylococcus aureus. This organism, commonly 
found in our environment, is responsible for many nosocomial infections. The import 

25 of this announcement becomes clear when one considers that vancomycin was used 
for years to treat infections caused by Staphylococcus species as well as other 
stubborn strains of bacteria. In short, bacteria are becoming resistant to our most 
powerful antibiotics. If this trend continues, it is conceivable that we will return to a 
time when what are presently considered minor bacterial infections are fatal 

30 diseases. 

Over-prescription and improper prescription habits by some physicians have 
caused an indiscriminate increase in the availability of antibiotics to the public. The 
patients are also partly responsible, since they will often improperly use the drug, 
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thereby generating yet another population of bacteria that is resistant, in whole or in 

part, to traditional antibiotics. 

The bacterial pathogens that have haunted humanity remain, in spite of the 

development of modern scientific practices to deal with the diseases that they cause. 
5 Drug resistant bacteria are now an increasing threat to the health of humanity. A new 

generation of antibiotics is needed to once again deal with the pending health threats 

that bacteria present. 

As more and more bacterial strains become resistant to the panel of available 

antibiotics, new antibiotics are required to treat infections. In the past, practitioners of 
10 pharmacology relied. upon traditional methods of drug discovery to generate novel, 

safe and efficacious compounds for the treatment of disease. Traditional drug 

discovery methods involve blindly testing potential drug candidate- molecules, often 

selected at random, in the hope that one might prove to be an effective treatment for 

some disease. The process is painstaking and laborious, with no guarantee of 
15 success. 

Newly emerging practices in drug discovery utilize a number of biochemical 
techniques to provide for directed approaches to creating new drugs, rather than 
discovering them at random. For example, gene sequences and proteins encoded 
thereby that are required for the proliferation of a cell or microorganism make 

20 excellent targets since exposure of bacteria to compounds active against these 

targets would result in the inactivation of the cell or microorganism. Once a target is 
identified, biochemical analysis of that target can be used to discover or to design 
molecules that interact with and alter the functions of the target. Use of physical and 
computational techniques to analyze structural and biochemical properties of targets 

25 in order to derive compounds that interact with such targets is called rational drug 
design and offers great potential. Thus, emerging drug discovery practices use 
molecular modeling techniques, combinatorial chemistry approaches, and other 
means to produce and screen and/or design large numbers of candidate compounds. 
Nevertheless, while this approach to drug discovery is clearly the way of the 

30 future, problems remain. For example, the initial step of identifying molecular targets 
for investigation can be an extremely time consuming task. It may also be difficult to 
design molecules that interact with the target by using computer modeling 
techniques. Furthermore, in cases where the function of the target is not known or is 
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poorly understood, it may be difficult to design assays to detect molecules that 
interact with and alter the functions of the target. To improve the rate of novel drug 
discovery and development, methods of identifying important molecular targets in 
pathogenic cells or microorganisms and methods for identifying molecules that 

5 interact with and alter the functions of such molecular targets are urgently required. 

The present invention is directed to identifying important molecular targets in 
a recently identified bacteria, Alloiococcus otitidis, which has been implicated in otitis 
media with effusion (OME). Otitis media, an inflammatory disease of the middle ear, 
is the most frequent cause of visits to pediatricians' offices in the United States 

10 (Schappert, 1991). Approximately 80% of all children experience at least one episode 
of otitis media by the age of three (Klein, 1994). There are three main types of otitis 
media: Acute otitis media (AOM), otorrhea, and otitis media with effusion (OME). 
Alloiococcus otitidis has only been associated with otitis media with effusion (OME), 
but this may be due to the difficulty of its detection by standard bacterial culturing 

15 methods. Its detection in the effusions is likely due to the fact that the effusions are 
normally sterile and few or no competing bacterial species are isolated from them. 
Without the interference of faster growing nasophryngeal species, the culture plates 
can be incubated for the longer duration needed to detect Alloiococcus otitidis 
colonies. 

20 Three other bacterial species are commonly isolated from middle ear 

effusions. These are nontypeable Haemophilus influenzae, Moraxella catarrhalis, and 
Streptococcus pneumoniae. One or more of these species have been found in one 
study to be associated with about 77% of all cases of OME using a PCR detection 
method (Post, 2000). This study did not include assaying for Alloiococcus otitidis, so 

25 a portion of the unaccounted cases may be due to this organism. 

The bacterium Alloiococcus otitidis was first isolated from the middle ear 
fluids of 10 children in the Buffalo, NY area with persistent OME and characterized as 
a large catalase negative, Gram-positive cocci that tend to occur in clumps, often in 
tetrads. It is slow growing and requires 2 to 5 days at 37°C before colonies can be 

30 seen on sheep blood agar plates. The bacterium was named Alloiococcus otitis by 
Aguirre and Collins (1992), who showed that it was different from other known Gram- 
positive species based on its 1 6S rRNA sequence. The bacterium's name has been 
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changed from Alloiococcus otitis to Alloiococcus otitidis. (Hendolin, et a!., (1999), and 
Hendoiin et al., (2000)). 

Several studies of the epidemiology Alioiococcus otitidis indicate it is 
associated with otitis media with effusion. These are summarized in Table 1 . These 
studies have been done using both culture and PCR techniques. The number of 
cases detected by culture, as might be expected from the fastidious growth 
requirements of the bacterium, was less than the number detected by PCR. 
Assuming that the bacterium is detected more accurately by the PCR method, the 
bacterium is detected in between 10 and 50% of patients with OME. This frequency 
suggests that this organism represents a significant public health problem. 
Consequently, there is a need for identifying gene targets in Alloiococcus otitidis for 
the development of anti-infectives. There is also a need for compositions for 
diagnosing Alloiococcus otitidis infection. 



15 



TABLE 1 1 SUMMARY OF STUDIES INDICATING AN ASSOCIATION OF ALLOIOCOCCUS 



% 

detected 


N a 


Method 


Reference 


8 


200 


Culture 


Faden & Dryja, J. Clin. Microbiol. 27:2488 (1989) 


3 


100 


Culture 


Sih etal., ICAAC (1992) 


20 


25 


PCR 


Hendolin et al., J. Clin. Microbiol. 35:2854 (1997) 


50 


12 


PCR 


Beswick, et al., Lancet 345:386 (1999) 


42 


67 


PCR 


Hendolin, et al., Pediatr. Infect. Dis. J. 18:860 (1999) 


10 


49 


PCR 


Hendolin et al., J. Clin. Microbiol. 38:125 (2000) 
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SUMMARY OF INVENTION 

The present invention broadly relates to AHoiococcus otitidis genomic 
sequence. Particularly, the invention relates to newiy identified polynucleotide open 
5 reading frames (ORFs) comprised within the genomic nucleotide sequence of 
AHoiococcus otitidis, and to polypeptides encoded by the ORFs. More particularly, 
the ORFs encode polypeptides that are essential for the growth and survivablity of 
AHoiococcus otitidis. 

Thus, in certain aspects, the invention relates to AHoiococcus otitidis ORFs 

10 that encode AHoiococcus otitidis polypeptides that function as enzymes in various 
biosynthetic pathways in the bacterium. In one embodiment, the invention relates to 
a purified or isolated AHoiococcus otitidis nucleic acid sequence comprising a 
nucleotide sequence selected from one of odd numbered sequences set forth in Seq. 
ID Nos: 1 to Seq. ID Nos: 105, wherein expression of said nucleic acid is essential for 

15 the proliferation of a cell. In a preferred embodiment the ORF selected from one of 
the odd numbered sequence listings set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105 
encodes an essential gene. The essential gene and the polypeptide encoded by 
them include ACPS (holo-(acyl carrier protein) synthase), murF (UDP-N- 
acetylmuramoylalanyi-D-glutamyl-2,6-diamino pimelate-D-alanyl-D-alanyl ligase) 

20 murA-2 (UDP-N-acetylglucosamine 1-carboxyvinyltransf erase), RpoE (DNA-directed 
RNA polymerase, delta subunit), rpoA (DNA-directed RNA polymerase alpha 
subunit), rpoC (RNA polymerase beta' subunit), rpoB (DNA-dependent RNA 
polymerase subunit beta), dnaB/C (DNA polymerase III delta prime subunit), gyrA 
(DNA gyrase A subunit), gyrB (DNA gyrase B subunit), dnaN (DNA polymerase 111 

25 beta chain, folC-2 (folyl-polyglutamate synthetase), murE (UDP-N-acetylmuramoyl-L- 
alanyl-D-glutamyl-L-lysine Ligase), srtA (sortase), folC-1 (folyl-polyglutamate 
synthetase), folB (dihydroneopterin aldolase), folK (7,8-dihydro-6- 
hydroxymethylpterin-pyrophosphokinase), mvaS (hydroxymethylglutary!-CoA 
synthase), mvaA (3-hydroxy-3-methylglutaryl-coenzyme a reductase), murB (UDP-N- 

30 acetylglucosaminyl-3-enolpyruvate reductase), mvaK2 (phosphomevalonate kinase), 
mvaD (mevalonate diphosphate decarboxylase), mvaK1 (mevalonate kinase), coaA 
(pantothenate kinase), nadE (NAD+ synthase), muri, Glutamate racemase), folP 
(Dihydropteroate synthase), folA (dihydrofolate reductase), grIB (topoisomerase IV B 
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subunit), grIA (topoisomerase IV A subunit), rpoD (transcription initiation factor 
sigma), dnaG (DNA primase), era (GTP-binding protein), norA (drug-export protein), 
polC (DNA polymerase III, alpha subunit), obg (GTP-binding protein), yphC (similar 
to Escherichia coli GTP-binding protein Era), dnaE (DNA polymerase III, alpha 
5 subunit), coaBC (phosphopantothenoylcysteine synthetase/decarboxylase), holA 
(DNA polymerase III delta subunit), coaD (phosphopantetheine adenyly transferase) 
ftsZ (Cell division protein ftsZ), ftsA (Cell division protein ftsA), murG (phospho-N- 
acetylmuramoyl-pentapeptide-transferase), murD (UDP-N-acetylmuramoyialanine D- 
glutamate ligase), nadD (nicotinic acid mononucleotide adenylyltransferase), coaE 
10 (dephospho-CoA kinase), murC (UDP-N-acetyl muramate-alanine ligase), fmhB 
FemX (factor essential for methiciilin resistance), pcrA (ATP-dependent DNA 
helicase), murA-1 (UDP-N-acetylglucosamine 1-carboxyvinyltransferase), holB (DNA 
polymerase III delta' subunit) and dnaX (DNA polymerase III -gamma and tau 
subunits). 

15 In another embodiment, the invention relates to purified or isolated nucleic 

acid of Alloiococcus otitidis comprising a fragment of one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, wherein said fragment is 
selected from the group consisting of fragments comprising at least 10, at least 20, at 
least 25, at least 30, at least 50 and more than 50 consecutive nucleotides of one of 

20 one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In yet another embodiment, the invention relates to a purified or isolated 
antisense nucleic acid comprising a nucleotide sequence complementary to at least a 
portion of an intragenic sequence, intergenic sequence, sequences spanning at least 
a portion of two or more genes, 5' noncoding region, or 3' noneoding region within an 

25 operon comprising a proliferation-required gene of Alloiococcus otitidis whose activity 
or expression is inhibited by an antisense nucleic acid and selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In a nother embodiment, the invention relates to a purified or isolated nucleic 
acid comprising a nucleotide sequence having at least 70% identity to a nucleotide 

30 sequence selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 
to Seq. ID Nos: 105, fragments comprising at least 25 consecutive nucleotides 
selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID 
Nos: 105, the nucleotide sequences complementary to one of odd numbered 
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sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, and the sequences 
complementary to fragments comprising at least 25 consecutive nucleotides of one of 
odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In another embodiment, the invention relates to a vector comprising a 

5 promoter operably linked to a nucleic acid encoding a polypeptide whose expression 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence of any 
one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In another embodiment, the invention relates to purified or isolated 
polypeptide of Alloiococcus otitidis comprising a polypeptide whose expression is 

10 inhibited by an antisense nucleic acid comprising a nucleotide sequence of one of 
odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a 
fragment selected from the group consisting of fragments comprising at least 5, at 
least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the said polypeptides. 

15 In yet another embodiment, the invention relates to purified or isolated 

Alloiococcus otitidis polypeptide comprising a amino acid sequence having at least 
25% amino acid identity to a polypeptide whose expression is inhibited by a nucleic 
acid comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or at least 25% amino 

20 acid identity to a fragment comprising at least 10, at least 20, at least 30, at least 40, 
at least 50, at least 60 or more than 60 consecutive amino acids of a polypeptide 
whose expression is inhibited by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of one of odd numbered sequences set forth in 
Seq. ID Nos: 1 to Seq. ID Nos: 105. 

25 In one embodiment, the invention relates to a purified or isolated Alloiococcus 

otitidis polypeptide comprising selected from one of the even numbered sequences 
set forth in Seq. ID Nos: 2 to Seq. ID Nos: 106, wherein the polypeptide is essential 
for the proliferation of a cell.. 

In yet another embodiment, the invention relates to a method of producing an 

30 Alloiococcus otitidis polypeptide comprising introducing into a cell a vector 

comprising a promoter operably linked to a nucleic acid comprising a nucleotide 
sequence encoding a polypeptide whose expression is essential for the proliferation 
and viability of Alloiococcus otitidis, and which is inhibited by an antisense nucleic 
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acid, and which is selected from one of odd numbered sequences set forth in Seq. ID 
Nos: 1 to Seq. ID Nos: 105. 

In yet another embodiment, the invention relates to a method of inhibiting the 
proliferation of Alloiococcus otitidis in an individual comprising inhibiting the activity or 

5 reducing the amount of a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105 or inhibiting the activity or 
reducing the amount of a nucleic acid encoding said gene product. 

In a preferred embodiment, the invention relates to method for identifying a 

10 compound which influences the activity of an Alloiococcus otitidis gene product , 
which is required for proliferation, said gene product comprising a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 
to Seq. ID Nos: 105, said method comprising: (a) contacting said gene product with a 

15 candidate compound; and (b) determining whether said compound influences the 
activity of said gene product. 

In a preferred embodiment, the invention relates to method for identifying a 
compound or an antisense nucleic acid having the ability to reduce activity or level of 
a Alloiococcus otitidis gene product, which is required for proliferation, said gene 

20 product comprising a gene product whose activity or expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, said method 
comprising the steps of: (a) contacting a target gene or RNA encoding said gene 
product with a candidate compound or antisense nucleic acid; and(b) measuring the 

25 activity of said target. 

In yet another preferred embodiment, the invention relates to method for 
inhibiting cellular proliferation of Alloiococcus otitidis comprising introducing an 
effective amount of a compound with activity against a gene whose activity or 
expression is essential for cellular proliferation, and which is inhibited by an 

30 antisense nucleic acid comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a compound 
with activity against the product of said gene into a population of Alloiococcus otitidis 
cells expressing said gene. 
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In a preferred embodiment, the invention relates to a composition comprising 
an effective concentration of an antisense nucleic acid comprising a nucleotide 
sequence selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 
5 to Seq. ID Nos: 105, or a proliferation-inhibiting portion thereof in a pharmaceutical^ 
acceptable carrier. 

. In a preferred embodiment, the invention relates to method for identifying a 
compound having the ability to inhibit proliferation of Alloiococcus otitidis cell 
comprising: (a) identifying a homologue of a gene or gene product whose activity or 

10 level is inhibited by a nucleic acid comprising a nucleotide sequence selected from 
one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, in a 
test cell, wherein said test cell is not Alloiococcus otitidis; (b) identifying an inhibitory 
nucleic acid sequence which inhibits the activity of said homologue in said test cell; 
(c) contacting said test cell with a sublethal level of said inhibitory nucleic acid, thus 

15 sensitizing said cell; (d) contacting the sensitized cell of step (c) with a compound; 
and (e) determining the degree to which said compound inhibits proliferation of said 
sensitized cell relative to a cell which does not contain said inhibitory nucleic acid. 

In a preferred embodiment, the invention relates to a method for identifying a 
compound having activity against a biological pathway required for proliferation 

20 comprising: (a) sensitizing a cell by providing a sublethal level of an antisense nucleic 
acid complementary to a nucleic acid encoding a gene product required for 
proliferation, wherein the activity or expression of said gene product is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, in said cell to 

25 reduce the activity or amount of said gene product; (b) contacting the sensitized cell 
with a compound; and (c) determining the degree to which said compound inhibits 
the growth of said sensitized cell relative to a cell which does not contain said 
antisense nucleic acid. 

In a preferred embodiment, the invention relates to a method for identifying a 

30 compound having the ability to inhibit one of the Alloiococcus otitidis polypeptides 

encoded by a polynucleotide selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105, and which is essential for cellular proliferation 
comprising: (a) contacting a cell which expresses the polypeptide with the compound; 
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and (b) determining whether said compound reduces proliferation of said contacted 
cell by acting on said gene product 

In a preferred embodiment, the invention relates to a method for identifying a 
compound having the ability to inhibit one of the purified and isolated Alloiococcus 

5 otitidis polypeptides selected from one of the even numbered sequences set forth in 
Seq. ID No.: 2 to Seq. ID No.: 106, and which is essential for cellular proliferation 
comprising: (a) contacting the purified and isolated polypeptide with the compound in 
vitro in the presence or absence of a substrate, which is essential for the activity of 
the polypeptide; and (b) dete.rmining the effect of the compound on the polypeptide 

10 by measuring the effect of the polypeptide on the substrate. 

In a preferred embodiment, the invention relates to a compound which 
interacts with an Alloiococcus otitidis polypeptide selected from one of the even 
numbered sequences set forth in Seq. ID No.: 2 to Seq. ID No.: 106 and inhibits its 
activity. 

15 in a preferred embodiment, the invention relates to a method for 

manufacturing an antimicrobial compound comprising the steps of screening one or 
more candidate compounds to identify a compound that reduces the activity or level 
of an Alloiococcus otitidis polypeptide selected from one of the even numbered 
sequences set forth in Seq. ID No.: 2 to Seq. ID No.: 106, said polypeptide 

20 comprising a gene product whose activity or expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from one of the odd 
numbered sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105; and 
manufacturing the compound so identified. 

In a preferred embodiment, the invention relates to a compound which inhibits 

25 proliferation of Alloiococcus otitidis by interacting with a gene encoding a polypeptide 
that is required for proliferation or with a polypeptide required for proliferation, 
wherein said polypeptide is selected from the group consisting of a gene product 
having at least 70% nucleotide sequence identity from one of the odd numbered 
sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, polypeptide encoded by a 

30 nucleic acid having at least 70% nucleotide sequence identity to a nucleic acid 

encoding a polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from one of the odd numbered 
sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, a polypeptide having at 
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least 25% amino acid identity to a gene product whose expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected one of the odd 
numbered sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, a polypeptide 
encoded by a nucleic acid comprising a nucleotide sequence which hybridizes to a 

5 nucleic acid selected from one of the odd numbered sequences set forth in Seq. ID 
No.: 1 to Seq. ID No. 105 under stringent conditions, a gene product encoded by a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid 
selected from one of the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. 
ID No. 105 under moderate conditions, and a gene product whose activity may be 

10 complemented by the gene product whose activity is inhibited by a nucleic acid 

selected from one of the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. 
ID No. 105. 



DETAILED DESCRIPTION OF THE INVENTION 

15 

A. Definitions: 

By "biological pathway" is meant any discrete cell function or process that is 
carried out by a gene product or a subset of gene products. Biological pathways 
include anabolic, catabolic, enzymatic, biochemical and metabolic pathways as well 

20 as pathways involved in the production of cellular structures such as cell walls. 
Biological pathways that are usually required for proliferation of cells or 
microorganisms include, but are not limited to, cell division, DNA synthesis and 
replication, RNA synthesis (transcription), protein synthesis (translation), protein 
processing, protein transport, fatty acid biosynthesis, electron transport chains, cell 

25 wall synthesis, cell membrane production, synthesis and maintenance, and the like. 

By "inhibit activity of a gene or gene product" is meant having the ability to 
interfere with the function of a gene or gene product in such a way as to decrease 
expression of the gene, in such a way as to reduce the level or activity of a product of 
the gene or in such a way as to inhibit the interaction of the gene or gene product 

30 with other biological molecules required for its activity. 

Agents which inhibit the activity of a gene include agents that inhibit 
transcription of the gene, agents that inhibit processing of the transcript of the gene, 
agents that reduce the stability of the transcript of the gene, and agents that inhibit 
translation of the mRNA transcribed from the gene. In microorganisms, agents which 
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inhibit the activity of a gene can act to decrease expression of the operon in which 
the gene resides or alter the folding or processing of operon RNA so as to reduce the 
level or activity of the gene product The gene product can be a non- translated RNA 
such as ribosomal RNA, a translated RNA (mRNA) or the protein product resulting 
5 from translation of the gene mRNA. Of particular utility to the present invention are 
antisense RNAs that have activities against the operons or genes to which they 

specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the 

function or to reduce the level or activity of the gene product in a cell. This includes, 
10 but is not limited to, inhibiting the enzymatic activity of the gene product or the ability 

of the gene product to interact with other biological molecules required for its activity, 

including inhibiting the gene product's assembly into a multimeric structure. 

By "activity against a protein" is meant having the ability to inhibit the function 

or to reduce the level or activity of the protein in a cell. This includes, but is not 
15 limited to, inhibiting the enzymatic activity of the protein or the ability of the protein to 

interact with other biological molecules required for its activity, including inhibiting the 

protein's assembly into a multimeric structure. 

By "activity against a nucleic acid" is meant having the ability to inhibit the 

function or to reduce the level or activity of the nucleic acid in a cell. This includes, 
20 but is not limited to, inhibiting the ability of the nucleic acid interact with other 

biological molecules required for its activity, including inhibiting the nucleic acid's 

assembly into a multimeric structure. 

By "activity against a gene" is meant having the ability to inhibit the function or 

expression of the gene in a cell. This includes, but is not limited to, inhibiting the 
25 ability of the gene to interact with other biological molecules required for its activity. 

By "activity against an operon" is meant having the ability to inhibit the function or 

reduce the level of one or morie products of the operon in a cell. This includes, but is 

not limited to, inhibiting the enzymatic activity of one or more products of the operon 

or the ability of one or more products of the operon to interact with other biological 
30 molecules required for its activity. 

By "antibiotic" is meant an agent which inhibits the proliferation of a cell or 

microorganism. 
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By "homologous coding nucleic acid" is meant a nucleic acid homologous to a 
nucleic acid encoding a gene product whose activity or level is inhibited by a nucleic 
acid selected from the group consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105 or a 
portion thereof. In some embodiments, the homologous coding nucleic acid may 

5 have at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at least 
70% nucleotide sequence identity to a nucleotide sequence selected from the group 
consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105 and fragments comprising at least 
10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 
nucleotides thereof. In other embodiments the homologous coding nucleic acids may 

10 have at least 97%, at least 5 95%, at least 90%, at least 85%, at least 80%, or at 
least 70% nucleotide sequence identity to a nucleotide sequence selected from the 
group consisting of the nucleotide sequences complementary to one of Seq ID Nos.: 
1 to Seq. ID Nos.: 105 and fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof. Identity may 

15 be measured using BLASTN version 2.0 with the default parameters or tBLASTX 
with the default parameters. (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A 
New Generation of Protein Database Search Programs, Nucleic Acid Res. 25: 3389- 
3402 (1997)) Alternatively a u homologuous coding nucleic acid" could be identified by 
membership of the gene of interest to a functional orthoiogue cluster. All other 

20 members of that orthoiogue cluster would be considered homologues. Such a library 
of functional orthoiogue clusters can be found at hltp://www.nebi.nlm. nib.gov/COG. A 
gene can be classified into a cluster of orthologous groups or COG by using the 
COGNITOR program available at the above web site, or by direct BLASTP 
comparison of the gene of interest to the members of the COGs and analysis of 

25 these results as described by Tatusov, R.L., Galperin, M.Y., Natale, D. A. and 

Koonin, E.V. (2000) The COG database: a tool for genome- scale analysis of protein 
functions and evolution. Nucleic Acids Research v. 2 8 n. 1 , pp3 3 -3 6. 

The term "homologous coding nucleic acid" also includes nucleic acids 
comprising nucleotide sequences which encode polypeptides having at least 99%, 

30 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at least 
50%, at least 40% or at least 25% amino acid identity or similarity to a polypeptide 
comprising the amino acid sequence of one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 or 
to a polypeptide whose expression is inhibited by a nucleic acid comprising a 
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nucleotide sequence of one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 or fragments 
comprising at least 5, 10, 15, 20, 25, 30,35, 40, 50, 75, 100, or 150 consecutive 
amino acids thereof as determined using the FASTA version 3.0t78 algorithm with 
the default parameters. Alternatively, protein identity or similarity may be identified 

5 using BLASTP with the default parameters, BLASTX with the default parameters, 
TBLASTN with the default parameters, or tBLASTX with the default parameters. 
(Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein 
Database Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997)). 

The term "homologous coding nucleic acid" also includes coding nucleic acids 

10 which hybridize under stringent conditions to a nucleic acid selected from the group 
consisting of the nucleotide sequences complementary to one of Seq ID Nos.: 1 to 
Seq. ID Nos.: 105 and coding nucleic acids comprising nucleotide sequences which 
hybridize under stringent conditions to a fragment comprising at least 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 

15 sequences complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105. 

As used herein, "stringent conditions" means hybridization to filter-bound 
nucleic acid in 6xSSC at about 45'C followed by one or more washes in 0. lxSSC/0.2/ 
SDS at about 680C. Other exemplary stringent conditions may refer, e.g., to washing 
in 6xSSC/0.05% sodium pyrophosphate at 37C, 48'C, 55'C, and 60'C as appropriate 

20 for the 5 particular probe being used. 

The term "homologous coding nucleic acid" also includes coding nucleic acids 
comprising nucleotide sequences which hybridize under moderate conditions to a 
nucleotide sequence selected from the group consisting of the sequences 
complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 and coding nucleic 

25 acids comprising nucleotide sequences which hybridize under moderate conditions to 
a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 
150,200,300,400, or 500 consecutive nucleotides of the sequences complementary 
to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105. As used herein, "moderate conditions" 
means hybridization to filter-bound DNA in 6x sodium chloride/sodium citrate (SSC) 

30 at about 45'C followed by one or more washes in 0.2xSSC/0. 1 % SDS at about 42- 
65'C. 

The term "homologous coding nucleic acids" also includes nucleic acids 
comprising nucleotide sequences which encode a gene product whose activity may 
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be complemented by a gene encoding a gene product whose activity is inhibited by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of 
Seq ID Nos.: 1 to Seq. ID Nos.: 105. In some embodiments, the homologous coding 
nucleic acids may encode a gene product whose activity is complemented by the 

5 gene product encoded by a nucleic acid comprising a nucleotide sequence selected 
from the group consisting Seq ID Nos.: 1 to Seq. ID Nos.: 105. In other 
embodiments, the homologous coding nucleic acids may comprise a nucleotide 
sequence encodes a gene product whose activity is complemented by one of the 
polypeptides of Seq ID Nos.: 1 to Seq. ID Nos.: 105 . 

10 The term "homologous antisense nucleic acid" includes nucleic acids 

comprising a nucleotide sequence having at least 97%, at least 95%, at least 90%, at 
least 85%, at least 80%, or at least 70% nucleotide sequence identity to a nucleotide 
sequence selected from the group consisting of one of the sequences of Seq ID 
Nos.: 1 to Seq. ID Nos.: 105 and fragments comprising at least 10, 15, 20, 25, 

15 30,35,40, 50, 75, 1 00, 150, 200,300,400, or 500 consecutive nucleotides thereof. 
Homologous antisense nucleic acids may also comprising nucleotide sequences 
which have at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at 
least 70% nucleotide sequence identity to a nucleotide sequence selected from the 
group consisting of the sequences complementary to one of sequences of Seq ID 

20 Nos.: 1 to Seq. ID Nos.: 105 and fragments comprising at least 10, 15, 20, 25, 30, 35, 
40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof. 

Nucleic acid identity may be determined as described above. 
The term "homologous antisense nucleic acid" also includes antisense nucleic acids 
comprising. nucleotide sequences which hybridize under stringent conditions to a 

25 nucleotide sequence complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 
and antisense nucleic acids comprising nucleotide sequences which hybridize under 
stringent conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 
75, 100, 150,200, 300, 400, or 500 consecutive nucleotides of the sequence 
complementary to one Seq ID Nos.: 1 to Seq. ID Nos.: 105. Homologous antisense 

30 nucleic acids also include antisense nucleic acids comprising nucleotide sequences 
which hybridize under stringent conditions to a nucleotide sequence selected from 
the group consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105, and antisense nucleic 
acids comprising nucleotide sequences which hybridize under stringent conditions to 
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a fragment comprising at least 10, 15, 20,25, 30, 35, 40, 50, 75, 
100,150,200,300,400, or 500 consecutive nucleotides of one of Seq ID Nos.: 1 to 

Seq. ID Nos.: 105. 

The term "homologous antisense nucleic acid" also includes antisense 

5 nucleic acids comprising nucleotide sequences which hybridize under moderate 

conditions to a nucleotide sequence complementary to one of Seq ID Nos.: 1 to Seq. 
ID Nos.: 105 and antisense nucleic acids comprising nucleotide sequences which 
hybridize under moderate conditions to a fragment comprising at least 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 

10 sequence complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105. 

Homologous antisense nucleic acids also include antisense nucleic acids comprising 
nucleotide sequences which hybridize under moderate conditions to a nucleotide 
sequence selected from the group consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105 
and antisense nucleic acids which comprising nucleotide sequences hybridize under 

15 moderate conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 
75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of one of Seq ID Nos.: 1 

to Seq. ID Nos.: 105. 

By "homologous polypeptide" is meant a polypeptide homologous to a 
polypeptide whose activity or level is inhibited by a nucleic acid comprising a 

20 nucleotide sequence selected from the group consisting of Seq ID Nos.: 1 to Seq. ID 
Nos.: 105 by a homologous antisense nucleic acid. The term "homologous 
polypeptide" includes polypeptides having at least 99%, 95%, at least 90%, at least 
85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 
25% amino acid identity or similarity to a polypeptide whose activity or level is 

25 inhibited by a nucleic acid selected from the group consisting of Seq ID Nos.: 1 to 

Seq. ID Nos.: 105 or by a homologous antisense nucleic acid, or polypeptides having 
at least 99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 
60%, at least 50%, at least 40% or at least 25% amino acid identity or similarity to a 
polypeptide to a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 

30 100, or 150 consecutive amino acids of a polypeptide whose activity or level is 
inhibited by a nucleic acid selected from the group consisting of Seq ID Nos.: 1 to 
Seq. ID Nos.: 105 or by a homologous antisense nucleic acid. Identity or similarity 
may be determined using the FASTA version 3. Ot78 algorithm with the default 
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parameters. Alternatively, protein identity or similarity may be identified using 
BLASTP with the default parameters, BLASTX with the default parameters, or 
TBLASTN with the default parameters. (Altschul, S.F. et al. Gapped BLAST and PS1- 
BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid Res. 

5 25:3389-3402 (1997). 

The term homologous polypeptide also includes polypeptides having at least 
99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at 
least 50%, at least 40% or at least 25% amino acid identity or similarity to a 
polypeptide selected from the group consisting of Seq ID Nos.: 2 to Seq. ID Nos.: 

10 106 and polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 
80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 25% amino 
acid identity or similarity to a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 
40, 5 0, 75, 100, or 1 50 consecutive amino acids of a polypeptide selected from the 
group consisting of Seq ID Nos.: 2 to Seq. ID Nos.: 106. 

15 The invention also includes polynucleotides, preferably DNA molecules, that 

hybridize to one of the nucleic acids of Seq ID Nos.: 2 to Seq. ID Nos.: 106 or the 
complements of any of the preceding nucleic acids. Such hybridization may be under 
stringent or moderate conditions as defined above or under other conditions which 
permit specific hybridization. The nucleic acid molecules of the invention that 

20 hybridize to these DNA sequences include oligodeoxynucleotides ("oligos") which 
hybridize to the target gene under highly stringent or stringent conditions. In general, 
for oligos between 14 and 70 nucleotides in length the melting temperature (Tm) is 
calculated using the formula: 

Tm ff) = 81 .5 + 1 6.6(log[monovalent cations (molar)] + 0.41 (%.G+Q -. (500N) 

25 where N is the length of the probe. If the hybridization is carried out in a solution 

containing formamide, the melting temperature may be calculated using the equation 

Tm(*C) = 81.5 + 16.6(log[monovalent cations (niolar)] + 0.4 1 (% G+C) - (0.6 
1) (% formamide) - (SOON) where N is the length of the probe. In general, 
hybridization is carried out at about 20-25 degrees below Tin (for DNA-DNA hybrids) 

30 or about 10-15 degrees below Tin (for RNA-DNA hybrids). 

Other hybridization conditions are apparent to those of skill in the art (see, for 
example, Ausubel, F.M. etal., eds., 1989, Current Protocols in Molecular Biology, 
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Vol. 1, Green Publishing Associates, Inc. and John Wiley & Sons, Inc., New York, at 
pp. 6.3.1-6.3.6 and 2.10.3. 

By "identifying a compound" is meant to screen one or more compounds in a 
collection of compounds such as a combinatorial chemical library or other library of 

5 chemical compounds or to characterize a single compound by testing the compound 
in a given assay and determining whether it exhibits the desired activity. 

By "inducer" is meant an agent or solution which, when placed in contact with 
a cell or microorganism, increases transcription, or inhibitor and/or promoter 
clearance/fidelity, from a desired promoter. 

10 As used herein, "nucleic acid" means DNA, RNA, or modified nucleic acids. 

Thus, the terminology ."the nucleic acid of SEQ ID NO: V or "the nucleic acid 
comprising the nucleotide sequence" includes both the DNA sequence of SEQ ID 
NO: X and an RNA sequence in which the thymidines in the DNA sequence have 
been substituted with uridines in the RNA sequence and in which the deoxyribose 

15 backbone "of the DNA sequence has been substituted with a ribose backbone in the 
RNA sequence. Modified nucleic acids are nucleic acids having nucleotides or 
structures which do not occur in nature, such as nucleic acids in which the 
intemucleotide phosphate residues with methylphosphonates, phosphorothioates, 
phosphoramidates, and phosphate esters. Nonphosphate intemucleotide analogs 

20 such as siloxane bridges, carbonate bridges, thioester bridges, as well as many 

others known in the art may also be used in modified nucleic acids. Modified nucleic 
acids may also comprise, (x-anomeric nucleotide units and modified micleotides such 
as 1 2 dideoxy-d-ribofuranose, 1,2-dideoxy- 1 -phenylribofuranose, and N4, N4- 
ethano-5 -methyl-cytosine are contemplated for use in the present invention. 

25 Modified nucleic acids may also be peptide nucleic acids in which the entire 

deoxyribose-phosphate backbone has been exchanged with a chemically completely 
different, but structurally homologous, polyamide (peptide) backbone containing 2- 
aminoethyl glycogen units. 

As used herein, "sub-lethal" means a concentration of an agent below the 

30 concentration required to inhibit all cell growth. 

A proliferation-required gene or gene family is one where, in the absence or 
substantial reduction of a gene transcript and/or gene product, growth or viability of 
the cell or microorganism is reduced or eliminated. Thus, as used herein, the 
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terminology "proliferation- required" or "required for proliferation" encompasses 
instances where the absence or substantial reduction of a gene transcript and/or 
gene product completely eliminates cell growth as well as instances where the 
absence of a gene transcript and/or gene product merely reduces cell growth. These 

5 proliferation-required genes can be used as potential targets for the generation of 
new antimicrobial agents. To achieve that goal, the present invention also 
encompasses assays for analyzing proliferation- required genes and for identifying 
compounds which interact with the gene and/or gene products of the proliferation- 
required genes. In addition, the present invention contemplates the expression of 

10 genes and the purification of the proteins encoded by the nucleic acid sequences 
identified as required proliferation genes and reported herein. The purified proteins 
can be used to generate reagents and screen small molecule libraries or other 
candidate compound libraries for compounds that can be further developed to yield 
novel antimicrobial compounds. 

15 The invention described herein addresses the need for identifying 

AUoiococcus otitidis proliferation-required gene or gene family that may be used to 
identify compounds, which are effective in preventing or treating most or all of the 
disease caused by AUoiococcus otitidis. The invention further addresses the need for 
methods of diagnosing AUoiococcus otitidis infection using the genes and the 

20 polypeptides identified herein. The inventors have identified novel AUoiococcus 
otitidis open reading frames (Ors), which encode proteins/polypeptides that are 
essential for the growth and proliferation of the bacteria. More particularly, the newly 
identified Ors encode polypeptides that are essential for proliferation of AUoiococcus 
otitidis, and thus serve as potential targets for antimicrobial compounds. Thus, in 

25 certain embodiments, the invention comprises AUoiococcus otitidis Ors encoding 

polypeptides that are essential for cellular proliferation, transcription gene products of 
AUoiococcus otitidis Ors, including, but not limited to mRNA, antisense RNA, 
antisense oligonucleotides, and ribozyme molecules, which can be used to inhibit or 
control growth of the microorganism. The invention relates also to methods of 

30 detecting AUoiococcus otitidis nucleic acids or polypeptides and kits for diagnosing 
AUoiococcus otitidis infection. The invention also relates to pharmaceutical . 
compositions, in particular antimicrobial compounds in pharmaceutical compositions, 
for the prevention and/or treatment of bacterial infection, in particular infection 
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caused by or exacerbated by Alloiococcus otitidis. 

B. Alloiococcus otitidis ORF Polynucleotides Encoding Polypeptides 
Essential for Proliferation 

5 

Isolated and purified Alloiococcus otitidis ORF polynucleotides of the present 
invention are contemplated for use in the production of Alloiococcus otitidis 
polypeptides. More specifically, in certain embodiments, the ORFs encode 
Alloiococcus otitidis polypeptides that are essential for cell proliferation. Thus, in one 

10 aspect, the present invention provides isolated and purified polynucleotides (ORFs) 
that encode Alloiococcus otitidis essential for cell proliferation. In particular 
embodiments, a polynucleotide of the present invention is a DNA molecule, wherein 
the DNA may be genomic DNA, plasmid DNA or cDNA. in a preferred embodiment, 
a polynucleotide of the present invention is a recombinant polynucleotide, which 

15 encodes an Alloiococcus otitidis polypeptide comprising an amino acid sequence that 
has at least 25% identity to an amino acid sequence of one of even numbered 
sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106 or a fragment thereof. 
In another embodiment, an isolated and purified ORF polynucleotide comprises a 
nucleotide sequence that has at least 70% identity to one of the ORF polynucleotide 

20 nucleotide sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 105, a 

degenerate. variant thereof, or a complement thereof. In yet another embodiment, an 
ORF polynucleotide of one of SEQ ID NO: 1 through SEQ ID NO: 105 is comprised 
in a plasmid vector and expressed in a host cell. In a preferred embodiment, the host 
eel! is a prokaryotic host cell. 

25 As used herein, the term "polynucleotide" means a sequence of nucleotides 

connected by phosphodiester linkages. Polynucleotides are presented herein in the 
direction from the 5' to the 3' direction. A polynucleotide of the present invention can 
comprise from about 10 to about several hundred thousand base pairs. Preferably, a 
polynucleotide comprises from about 10 to about 3,000 base pairs. Preferred lengths 

30 of particular polynucleotide are set forth hereinafter. 

A polynucleotide of the present invention can be a deoxyribonucleic acid 
(DNA) molecule, a ribonucleic acid (RNA) molecule, or analogs of the DNA or RNA 
generated using nucleotide analogs. The nucleic acid molecule can be single- 
stranded or double-stranded, but preferably is double-stranded DNA. Where a 
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polynucleotide is a DNA molecule, that molecule can be a gene, a cDNA molecule or 
a genomic DNA molecule. Nucleotide bases are indicated herein by a single letter 
code: adenine (A), guanine (G), thymine (T) and cytosine (C). 

"Isolated" means altered "by the hand of man" from the natural state. An 
5 "isolated" composition or substance is one that has been changed or removed from 
its original environment, or both. For example, a polynucleotide or a polypeptide 
naturally present in a living animal is not "isolated," but the same polynucleotide or 
polypeptide separated from the coexisting materials of its natural state is "isolated," 
as the term is employed herein. 

10 Preferably, an "isolated" polynucleotide is free of sequences which naturally 

flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the nucleic 
acid) in the genomic DNA of the organism from which the nucleic acid is derived. For 
example, in various embodiments, the isolated Alloiococcus otitidis nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0. 5 kb or 0. 1 kb of 

15 nucleotide sequences which naturally flank the nucleic acid molecule in genomic 
DNA of the cell from which the nucleic acid is derived. However, the Alloiococcus 
otitidis nucleic acid molecule can also be fused to heterologous protein encoding or 
regulatory sequences and still be considered isolated. 

ORF polynucleotides of the present invention may also be obtained using 

20 standard cloning and screening techniques from a cDNA library derived from mRNA. 
Polynucleotides of the invention can also be obtained from natural sources such as 
genomic DNA libraries {e.g., an Alioiococcus otitidis library) or can be synthesized 
using well-known and commercially available techniques. As contemplated in the 
present invention, ORF polynucleotides are obtained using Alloiococcus otitidis 
. 25 chromosomal DNA as the template. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences set forth in the odd numbered sequences listed in ID NO: 1 
through SEQ ID NO: 105 (and fragments thereof) due to degeneracy of the genetic 
code, and thus encode the same Alloiococcus otitidis polypeptides as those encoded 

30 by the amino acid sequences shown in even numbered sequences set forth in SEQ 
ID NO:2 through SEQ ID NO: 106 

Orthologs and allelic variants of the Alioiococcus otitidis polynucleotides are 
readily identified using methods well known in the art. An allelic variant or an 
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orthologue of the polynucleotides comprises a nucleotide sequence that is typically at 
least about 70-75%, more typically at least about 80-85%, and most typically at least 
about 90-95% or more homologous to the nucleotide sequence shown in one of the 
odd numbered sequences set forth in SEQ ID NO:1 through SEQ ID NO: 105, or a 
5 fragment of these nucleotide sequences. Such nucleic acid molecules are readily 
identified as being able to hybridize, preferably under stringent conditions, to the 
nucleotide sequence shown in one of the odd numbered sequences set forth in SEQ 
ID NO:1 through SEQ ID NO: 105, or a fragment of these nucleotide sequences. 

Moreover, the polynucleotides of the invention can comprise only a fragment 
10 of the coding region of an Alloiococcus otitidis polynucleotide or gene, such as a 
fragment of one of the odd numbered sequences set forth in SEQ ID NO:1 through 
SEQ ID NO: 105. 

When the ORF polynucleotides of the invention are used for the recombinant 
production of Alloiococcus otitidis polypeptides of the present invention, the 

15 polynucleotide may include the coding sequence for the mature polypeptide, by itself, 
or the coding sequence for the mature polypeptide in reading frame with other coding 
sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- 
or prepro- protein sequence, or other fusion peptide portions. For example, a marker 
sequence which facilitates purification of the fused polypeptide can be linked to the 

20 coding sequence (seeGentz etal., 1989, incorporated herein by reference). Thus, 
contemplated in the present invention is the preparation of polynucleotides encoding 
fusion polypeptides permitting His-tag purification of expression products. The 
polynucleotide may also contain non-coding 5' and 3' sequences, such as 
transcribed, non-translated sequences, splicing and polyadenylation signals. 

25 Thus, a polynucleotide encoding a polypeptide of the present invention, 

including homologs and orthologs from species other than Alloiococcus otitidis, may 
be obtained by a process which comprises the steps of screening an appropriate 
library under stringent hybridization conditions with a labeled probe having the 
sequence of one of the odd numbered sequences set forth in SEQ ID NO:1 through 

30 SEQ ID NO: 1 05 or a fragment thereof; and isolating full-length cDNA and genomic 
clones containing the polynucleotide sequence. Such hybridization techniques are 
well known to the skilled artisan. The skilled artisan will appreciate that, in many 
cases, an isolated cDNA sequence will be incomplete, in that the region coding for 
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the polypeptide is cut short at the 5' end of the cDNA. This is a consequence of 
reverse transcriptase, an enzyme with inherently low "processivity" (a measure of the 
ability of the enzyme to remain attached to the template during the polymerization 
reaction), failing to complete a DNA copy of the mRNA template during the first- 

5 strand cDNA synthesis. 

Thus, in certain embodiments, the polynucleotide sequence information 
provided by the present invention allows for the preparation of relatively short DNA 
(or RNA) oligonucleotide sequences having the ability to specifically hybridize to 
gene sequences of the selected polynucleotides disclosed herein. The term 

10 "oligonucleotide" as used herein is defined as a molecule comprised of two or more 
deoxyribonucleotides or ribonucleotides, usually more than three (3), and typically • 
more than ten (10) and up to one hundred (100) or more (although preferably 
between twenty and thirty). The exact size will depend on many factors, which in 
turn depends on the ultimate function or use of the oligonucleotide. Thus, in 

15 particular embodiments of the invention, nucleic acid probes of an appropriate length 
are prepared based on a consideration of a selected nucleotide sequence, e.g., a 
sequence such as that shown in one of the odd numbered sequences set forth in 
SEQ ID NO:1 through SEQ ID NO: 105. The ability of such nucleic acid probes to 
specifically hybridize to a polynucleotide encoding an Alloiococcus otitidis 

20 polypeptide lends them particular utility in a variety of embodiments. Most 

importantly, the probes can be used in a variety of assays for detecting the presence 
of complementary sequences in a given sample. 

In certain embodiments, it is advantageous to use oligonucleotide primers. 
These primers are generated in any manner, including chemical synthesis, DNA 

25 replication, reverse transcription, or a combination thereof. The sequence of such 
primers is designed using a polynucleotide of the present invention for use in 
detecting, amplifying or mutating a defined segment of an ORF polynucleotide that 
encodes an Alloiococcus otitidis polypeptide from prokaryotic cells using polymerase 
chain reaction (PCR) technology. 

30 In certain embodiments, it is advantageous to employ a polynucleotide of the 

present invention in combination with an appropriate label for detecting hybrid 
formation. A wide variety of appropriate labels are known in the art, including 
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radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of 
giving a detectable signal. 

Polynucleotides which are identical or sufficiently identical to a nucleotide 
sequence contained in one of the odd numbered sequences set forth in SEQ ID 

5 NO:1 through SEQ ID NO: 105, or a fragment thereof, may be used as hybridization 
probes for cDNA and genomic DNA or as primers for a nucleic acid amplification 
(PCR) reaction, to isolate full-length cDNAs and genomic clones encoding 
polypeptides of the present invention and to isolate cDNA and genomic clones of 
other genes (including genes encoding homologs and orthologs from species other 

10 than Alloiococcus otitidis) that have a high sequence similarity to polynucleotide 

sequences set forth in one of the odd numbered sequences set forth in SEQ ID NO:1 
through SEQ ID NO:1 05, or a fragment thereof. Typically these nucleotide 
sequences are from at least 70% identical to at least about 95% identical to that of 
the reference polynucleotide sequence. The probes or primers will generally 

15 comprise at least 15 nucleotides, preferably, at least 30 nucleotides and may have at 
least 50 nucleotides. Particularly preferred probes will have between 30 and 50 
nucleotides. 

There are several methods available and well known to those skilled in the art 
to obtain full-length cDNAs, or extend short cDNAs, for example those based on the 

20 method of Rapid Amplification of cDNA ends (RACE) (see, Frohman et at., 1 988). 
Recent modifications of the technique, exemplified by the Marathon™ technology 
[Promega, Madison, Wl], for example, have significantly simplified the search for 
longer cDNAs. In the Marathon™ technology, cDNAs have been prepared from 
mRNA extracted from a chosen tissue and an "adaptor" sequence ligated onto each 

25 end. Nucleic acid amplification (PCR) is then carried out to amplify the "missing" 5' 
end of the cDNA using a combination of gene specific and adaptor specific 
oligonucleotide primers. The PCR reaction is then repeated using "nested" primers, 
that is, primers designed to anneal within the amplified product (typically an adaptor 
specific primer that anneals further 3' in the adaptor sequence and a gene specific 

30 primer that anneals further 5 1 in the known gene sequence). The products of this 
reaction are then analyzed by DNA sequencing and a full-length cDNA constructed 
either by joining the product directly to the existing cDNA to give a complete 
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sequence, or carrying out a separate full-length PCR using the new sequence 
information for the design of the 5' primer. 

To provide certain of the advantages in accordance with the present 
invention, a preferred nucleic acid sequence employed for hybridization studies or 
5 assays includes probe molecules that are complementary to at least a 1 0 to about 70 
nucleotides long stretch of a polynucleotide that encodes an Adoiococcus otitidis 
polypeptide, such as that shown in one of the even numbered sequences set forth in 
SEQ ID NO: 2 through SEQ ID NO: 106. A size of at least 10 nucleotides in length 
helps to ensure that the fragment will be of sufficient length to form a duplex 
10 molecule that is both stable and selective. Molecules having complementary 

sequences over stretches greater than 10 bases in length are generally preferred in 
order to increase stability and selectivity of the hybrid, and thereby improve the 
quality and degree of specific hybrid molecules obtained. It is generally preferable to 
design nucleic acid molecules with gene-complementary stretches of 25 to 40 
15 nucleotides, 55 to 70 nucleotides, or even longer where desired. For example, such 
fragments are readily prepared by directly synthesizing the fragment by chemical 
means, by application of nucleic acid reproduction technology, such as the PCR 
technology (U.S. Patent 4,683,202, incorporated herein by reference), or by excising 
selected DNA fragments from recombinant plasmids containing appropriate inserts 
20 and suitable restriction enzyme sites. 

In another aspect, the present invention contemplates an isolated and purified 
polynucleotide comprising a nucleotide sequence that is identical or complementary 
to a segment of at least 10 contiguous bases of one of the odd numbered sequences 
set forth in SEQ ID NO: 1 through SEQ ID NO: 105, wherein the polynucleotide 
25 hybridizes to a polynucleotide that encodes an Alloiococcus otitidis polypeptide. 

Preferably, the isolated and purified polynucleotide comprises a base sequence that 
is identical or complementary to a segment of at least 25 to 70 contiguous bases of 
one of the odd numbered sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 
105. For example, the polynucleotide of the invention can comprise a segment of 
30 bases identical or complementary to from 40 to 55 contiguous bases of the disclosed 
nucleotide sequences. 

Accordingly, a polynucleotide probe molecule of the invention can be used for 
its ability to selectively form duplex molecules with complementary stretches of the 
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gene. Depending on the application envisioned t varying conditions of hybridization 
are employed to achieve varying degrees of selectivity of the probe toward the target 
sequence. For applications requiring a high degree of selectivity, relatively stringent 
conditions are employed to form the hybrids. Of course, for some applications, for 

5 example, where one desires to prepare mutants employing a mutant primer strand 
hybridized to an underlying template or where one seeks to isolate an Alloiococcus 
otitidis homologous polypeptide coding sequence from other cells, functional 
equivalents, or the like, less stringent hybridization conditions are typically needed to 
allow formation of the heteroduplex (see Table 2). Cross-hybridizing species are 

10 thereby readily identified as positively hybridizing signals with respect to control 

hybridizations. Thus, hybridization conditions are readily manipulated, and thus will 
generally be a method of choice depending on the desired results. 

Of course, for some applications, for example, where one desires to prepare 
mutants employing a mutant primer strand hybridized to an underlying template or 

15 where one seeks to isolate a homologous polypeptide coding sequence from other 
cells, functional equivalents, or the like, less stringent hybridization conditions are 
typically needed to allow formation of the heteroduplex. Cross-hybridizing species 
are thereby readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be 

20 rendered more stringent by the addition of increasing amounts of formamide, which 
serves to destabilize the hybrid duplex in the same manner, as increased 
temperature. Thus, hybridization conditions are readily manipulated, and thus are 
generally a method of choice depending on the desired results. 

The present invention also includes polynucleotides capable of hybridizing 

25 under reduced stringency conditions, more preferably stringent conditions, and most 
preferably highly stringent conditions, to polynucleotides described herein. Examples 
of stringency conditions are shown in the table below: highly stringent conditions are 
those that are at least as stringent as, for example, conditions A-F; stringent 
conditions are at least as stringent as, for example, conditions G-L; and reduced 

30 stringency conditions are at least as stringent as, for example, conditions M-R. 
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Table 2 
Stringency Conditions 



stringency 
Condition 


roiynucicoiiutr 

Hybrid 


Hwhriri 1 pnntH 
riyuiiu lciimui 

(bp)' 


Hvbridizatton 
Temperature and 
Buffer" 


Wash TemDerature and 
BufferH 


A 






65°C* 1xSSC -or- 
42 °C; 1xSSC, 50% 
formamide 


65 °C* 0 3xSSC 


B 


DNA:DNA 


<50 


T B ; 1xSSC 


T B ; 1xSSC 


C 


DNA:RNA 


>50 


67 °C; 1xSSC -or- 
45 °C; 1xSSC, 50% 
formamide 


67 °C; 0.3xSSC 


D 


DNA:RNA 


<50 


T D ; 1xSSC 


T D ; 1xSSC 


E 


RNA:RNA 


>50 


70 °C; 1xSSC -or- 
50 °C; 1xSSC, 50% 

forma mirip 

1 \J 1 1 i Idl 1 llvIC 


70 °C; 0.3xSSC 


F 


RNA:RNA 


<50 


T F ; 1xSSC 


T F ; 1xSSC 


G 


DNA:DNA 


>50 


65 °C; 4xSSC -or- 
42°C;4xSSC, 50% 
formamide 


65 °C; txSSC 


H 


DNA:DNA 


< 50 


T H ; 4xSSC 


T H ; 4xSSC 


1 


DNA:RNA 


> 50 


67 O, 4XooO -or- 
45 °C; 4xSSC,50% 
formamide 




J 


DNA:RNA 


<50 


Tj; 4xSSC 


Tj; 4xSSC 


K 


RNA:RNA 


> 50 


70 C; 4xSSC -or- 
50 EC; 4xSSC, 50% 
formamide 


67 C t ixSSC 


L 


RNArRNA 


<50 


T L ; 2xSSC 


T L ; 2xSSC 


M 


DNA:DNA 


>50 


50 °C; 4xSSC -or- 
40 °C; 6xSSC, 50% 
formamide 


50 °C; 2xSSC 


N 


DNA:DNA 


<50 


T N ; 6xSSC 


Tn; 6xSSC 


O 


DNA:RNA 


>50 


55°C;4xSSC -or- 
42 °C; 6xSSC, 50% 
formamide 


55°C;2xSSC 


P 


DNA:RNA 


<50 


T P ; 6xSSC 


T P ; 6xSSC 


Q 


RNAiRNA 


>50 


60 °C;4xSSC -or- 
45 °C; 6xSSC, 50% 
formamide 


60 °C; 2xSSC 


R 


RMA:RNA 


<50 


T R ; 4xSSC 


T R ; 4xSSC 



(bp) 1 : The hybrid length is that anticipated for the hybridized region(s) of the 
hybridizing polynucleotides. When hybridizing a polynucleotide to a target 
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polynucleotide of unknown sequence, the hybrid length is assumed to be that of the 
hybridizing polynucleotide. When polynucleotides of known sequence are 
hybridized, the hybrid length can be determined by aligning the sequences of the 
polynucleotides and identifying the region or regions of optimal sequence 
5 complementarity. 

Buffer*" 1 : SSPE (IxSSPE is 0.1 5M NaCI, 10mM NaH 2 P0 4 , and 1.25mM EDTA, 
pH 7.4), can be substituted for SSC (1xSSC is 0.1 5M NaCi and 15mM sodium 
citrate) in the hybridization and wash buffers; washes are performed for 15 minutes 
after hybridization is complete. 
10 T B through T R : The hybridization temperature for hybrids anticipated to be 

less than 50 base pairs in length should be 5-1 0EC less than the melting temperature 
(T m ) of the hybrid, where T m is determined according to the following equations. For 
hybrids less than 1 8 base pairs in length, T m (EC) = 2(# of A + T bases) + 4(# of G + 
C bases). For hybrids between 18 and 49 base pairs in length, T m (EC) = 81 .5 + 
15 16.6(iog 10 [Na + ]) + 0.41 (%G+C) - (600/N), where N is the number of bases in the 

hybrid, and [Na + ] is the concentration of sodium ions in the hybridization buffer ([Na + ] 
for 1xSSC = 0.165 M). 

Additional examples of stringency conditions for polynucleotide hybridization 
are provided in Sambrook etaL, 1989, Molecular Cloning: A Laboratory Manual, Cold 
20 Spring Harbor Laboratory Press, Cold Spring Harbor, NY, chapters 9 and 1 1 , and 
Ausubel etaL, 1995, Current Protocols in Molecular Biology, Eds., John Wiley & 
Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

In addition to the nucleic acid molecules encoding Alfoiococcus otitidis 
polypeptides described above, another aspect of the invention pertains to isolated 
25 nucleic acid molecules that are antisense thereto. An "antisense" nucleic acid 

comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense 
nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid 
30 can be complementary to an entire Alloiococcus otitidis coding strand, or to only a 
fragment thereof. In one embodiment, an antisense nucleic acid molecule is 
antisense to a "coding region" of the coding strand of a nucleotide sequence 
encoding an Alloiococcus otitidis polypeptide. 
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The term "coding region" refers to the region of the nucleotide sequence 
comprising codons which are translated into amino acid residues, e.g., the entire 
coding region of each of the odd numbered sequences set forth in SEQ ID NO: 1 
through SEQ ID NO: 105. In another embodiment, the antisense nucleic acid 
5 molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
sequence encoding an AHoiococcus otitidis polypeptide. The term "noncoding 
region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids {i.e., also referred to as 5* and 3' untranslated regions). 
Given the coding strand sequence encoding the AHoiococcus otitidis 
10 polypeptides disclosed herein antisense nucleic acids of the invention can be 

designed according to the rules of Watson and Crick base pairing. The antisense 
nucleic acid molecule can be complementary to the entire coding region of 
AHoiococcus otitidis mRNA, but more preferably is an oligonucleotide which is 
antisense to only a fragment of the coding or noncoding region of AHoiococcus otitidis 
15 mRNA. For example, the antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of AHoiococcus otitidis mRNA. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 
35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic ligation reactions using 
20 procedures known in the art. For example, an antisense nucleic acid (e.g., an 

antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives 
25 and acridine substituted nucleotides can be used. Examples of modified nucleotides 
which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
30 N6-isopentenyladenine, i-methylguanine, l-methylinosine, 2,2-dimethylguanine, 2- 

methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methyiaminomethyluracil, 5-methoxyaminomethyl-2-thiouracii, 
beta-D-mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2- 
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methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, S-methyl-2-thiouracil, 2-thiouraciI, 4- 
thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracii-5-oxyacetic acid 
(v), 5-methyl-2-thiouraciI, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

5 diaminopurine. 

Alternatively, the antisense nucleic acid can be produced biologically using an 
expression vector into which a nucleic acid has been subcloned in an antisense 
orientation {i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the 

10 following subsection). 

The antisense nucleic acid molecules of the invention are typically 
administered to a subject or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding an Alloiococcus otitidis polypeptide to 
thereby inhibit expression of the polypeptide, e.g., by inhibiting transcription and/or 

15 translation. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific interactions in the major 
groove of the double helix. An example of a route of administration of an antisense 
nucleic acid molecule of the invention includes direct injection at a tissue site. 

20 Alternatively, an antisense nucleic acid molecule can be modified to target selected 
cells and then administered systemically. For example, for systemic administration, 
an antisense molecule can be modified such that it specifically binds to a receptor or 
an antigen expressed on a selected cell surface, e.g., by linking the antisense nucleic 
acid molecule to a peptide or an antibody which binds to a cell surface receptor or 

25 antigen. The antisense nucleic acid molecule can also be delivered to cells using the 
vectors described herein. 

In yet another embodiment, the antisense nucleic acid molecule of the 
invention is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid 
molecule forms specific double-stranded hybrids with complementary RNA in which, 

30 contrary to the usual y-unrts, the strands run parallel to each other (Gaultier et ai. t 
1987). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide (Inoue et a/., 1987) or a chimeric RNA-DNA analogue (Inoue et 
ai t 1987). 
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In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 

5 described in Haselhoff and Gerlach, 1 988) can be used to catalyticaily cleave 

Alloiococcus otitidis mRNA transcripts to thereby inhibit translation of Alloiococcus 
otitidis mRNA. A ribozyme having specificity for an Alloiococcus of/f/d/s-encoding 
nucleic acid can be designed based upon the nucleotide sequence of an 
Alloiococcus otitidis cDNA disclosed herein. For example, a derivative of a 

10 Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of 
the active site is complementary to the nucleotide sequence to be cleaved in an 
Alloiococcus of/f/d/s-encoding mRNA. See, e.g., Cech et a/. U.S. 4,987,071 and 
Cech etal. U.S. 5,116,742 both incorporated herein in their entirety by reference. 
Alternatively, Alloiococcus otitidis mRNA can be used to select a catalytic RNA 

15 having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., 
Bartel and Szostak, 1993. 

Alternatively Alloiococcus otitidis gene expression can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the 
Alloiococcus otitidis gene (e.g., the Alloiococcus otitidis gene promoter and/or 

20 enhancers) to form triple helical structures that prevent transcription of the 

Alloiococcus otitidis gene in target cells. See generally, Helene, 1991; Helene etal., 
1 992; and Maher, 1 992. 

Alloiococcus otitidis gene expression can also be inhibited using RNA 
interference (RNAi). This is a technique for post-transcriptional gene silencing 

25 (PTGS), in which target gene activity is specifically abolished with cognate double- 
stranded RNA (dsRNA). RNAi resembles in many aspects PTGS in plants and has 
been detected in many invertebrates including trypanosome, hydra, planaria, 
nematode and fruit fly (Drosophila melangnoster). It may be involved in the 
modulation of transposable element mobilization and antiviral state formation. RNAi 

30 in mammalian systems is disclosed in WO 00/63364, which is incorporated by 

reference herein in its entirety. Basically, dsRNA of at least about 600 nucleotides, 
homologous to the target is introduced into the cell and a sequence specific reduction 
in gene activity is observed. 
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C. Alloiococcus otitidis Polypeptides 

In particular embodiments, the present invention provides isolated and 
5 purified Alloiococcus otitidis polypeptides. Preferably, an Alloiococcus otitidis 

polypeptide of the invention is a recombinant polypeptide. In certain embodiments, 
an Alloiococcus otitidis polypeptide of the present invention comprises the amino acid 
sequence that has at least 25% identity to the amino acid sequence of one of the 
even numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106, a 
10 biological equivalent thereof, or a fragment thereof. 

An Alloiococcus otitidis polypeptide according to the present invention 
encompasses a polypeptide that comprises: 1) the amino acid sequence shown in 
one of the even numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 
106) functional and non-functional naturally occurring variants or biological 
15 equivalents of Alloiococcus otitidis polypeptides of the even numbered sequences set 
forth in SEQ ID NO: 2 through SEQ ID NO: 106 and recombinantly produced variants 
or biological equivalents of Alloiococcus otitidis polypeptides set out in SEQ ID NO: 2 
through SEQ ID NO: 106) polypeptides isolated from organisms other than 
Alloiococcus otitidis (orthologs of Alloiococcus otitidis polypeptides.) 

20 A biological equivalent or variant of an Alloiococcus otitidis polypeptide 

according to the present invention encompasses 1) a polypeptide isolated from 
Alloiococcus otitidis] and 2) a polypeptide that contains substantial homology to an 
Alloiococcus otitidis polypeptide. 

Biological equivalents or variants of Alloiococcus otitidis include both 

25 functional and non-functional Alloiococcus otitidis polypeptides. Functional biological 
equivalents or variants are naturally occurring amino acid sequence variants of an 
Alloiococcus otitidis polypeptide that maintain the ability to elicit an immunological or 
antigenic response in a subject. Functional variants will typically contain only 
conservative substitutions of one or more amino acids in any one of even numbered 

30 sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106 or substitution, 

deletion or insertion of non-critical residues in non-critical regions of the polypeptide. 

The present invention further provides non-/A//o/ococcus otitidis orthologues of 
Alloiococcus otitidis polypeptides. Orthologues of Alloiococcus otitidis polypeptides 
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are polypeptides that are isolated from non-Moiococcus otitidis organisms and 
possess antigenic capabilities of the Alloiococcus otitidis polypeptide. Orthologues of 
an Alloiococcus otitidis polypeptide can readily be identified as comprising an amino 
acid sequence that is substantially homologous to one of the even numbered 

5 sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. 

Modifications and changes can be made in the structure of a polypeptide of 
the present invention and still obtain a molecule having Alloiococcus otitidis 
antigenicity. For example, certain amino acids can be substituted for other amino 
acids in a sequence without appreciable loss of antigenicity. Because it is the 

10 interactive capacity and nature of a polypeptide that defines that polypeptide's 

biological functional activity, certain amino acid sequence substitutions can be made 
in a polypeptide sequence (or, of course, its underlying DNA coding sequence) and 
nevertheless obtain a polypeptide with like properties. 

In making such changes, the hydropathic index of amino acids can be 

15 considered. The importance of the hydropathic amino acid index in conferring 

interactive biologic function on a polypeptide is generally understood in the art (Kyte 
& Doolittle, 1 982). It is known that certain amino acids can be substituted for other 
amino acids having a similar hydropathic index or score and still result in a 
polypeptide with similar biological activity. Each amino acid has been assigned a 

20 hydropathic index on the basis of its hydrophobicity and charge characteristics. 
Those indices are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine 
(+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); 
threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1 .3); proline (-1.6); 
histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); 

25 lysine (-3.9); and arginine (-4.5). 

It is believed that the relative hydropathic character of the amino acid residue 
determines the secondary and tertiary structure of the resultant polypeptide, which in 
turn defines the interaction of the polypeptide with other molecules, such as 
enzymes, substrates, receptors, antibodies, antigens, and the like. It is known in the 

30 art that an amino acid can be substituted by another amino acid having a similar 
hydropathic index and still obtain a functionally equivalent polypeptide. In such 
changes, the substitution of amino acids whose hydropathic indices are within +/-2 is 
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preferred, those within +/-1 are particularly preferred, and those within +/-0.5 are 
even more particularly preferred. 

Substitution of like amino acids can also be made on the basis of 
hydrophilicity, particularly where the biologically functional equivalent polypeptide or 

5 peptide thereby created is intended for use in immunological embodiments. U.S. Pat. 
No. 4,554,101 , incorporated herein by reference, states that the greatest local 
average hydrophilicity of a polypeptide, as governed by the hydrophilicity of its 
adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. with a 
biological property of the polypeptide. 

10 As detailed in U.S. Pat. No. 4,554,1 01 , the following hydrophilicity values 

have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ±1); glutamate (+3.0 ±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); proline (-0.5 ±1); threonine (-0.4); alanine (-0.5); histidine (-0.5); cysteine 
(-1 .0); methionine (-1 .3); valine (-1 .5); leucine (-1 .8); isoleucine (-1 .8); tyrosine (-2.3); 

15 phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent polypeptide. 
In such changes, the substitution of amino acids whose hydrophilicity values are 
within ±2 is preferred, those which are within ±1 are particularly preferred, and those 

20 within ±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on 
the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions 
which take various of the foregoing characteristics into consideration are well known 

25 to those of skill in the art and include: arginine and lysine; glutamate and aspartate; 
serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine 
(See Table 3, below). The present invention thus contemplates functional or 
biological equivalents of an Alldiococcus otitidis polypeptide as set forth above. 
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TABLE 3: 
AMINO ACID SUBSTITUTIONS 



Original Residue Exemplary Residue 

Substitution 



Ala 


Gly; Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Ala 


His 


Asn; Gin 


tie 


Leu; Val 


Leu 


!le; Va! 


Lys 


Arg 


Met 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Biological or functional equivalents of a polypeptide are also prepared using 
5 site-specific mutagenesis. Site-specific mutagenesis is a technique useful in the 
preparation of second generation polypeptides, or biologically functional equivalent 
polypeptides or peptides, derived from the sequences thereof, through specific 
mutagenesis of the underlying DNA. As noted above, such changes can be 
desirable where amino acid substitutions are desirable. The technique further 

10 provides a capacity to prepare and test sequence variants, for example, incorporating 
one or more of the foregoing considerations, by introducing one or more nucleotide 
sequence changes into the DNA. Site-specific mutagenesis allows the production of 
mutants through the use of specific oligonucleotide sequences which encode the 
DNA sequence of the desired mutation, as well as a sufficient number of adjacent 

15 nucleotides, to provide a primer sequence of sufficient size and sequence complexity 
to form a stable duplex on both sides of the deletion junction being traversed. 
Typically, a primer of about 17 to 25 nucleotides in length is preferred, with about 5 to 
1 0 residues on both sides of the site of the alteration of the sequence. 

In general, the technique of site-specific mutagenesis is well known in the art. 

20 As will be appreciated, the technique typically employs a phage vector, that can exist 
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in both a single stranded and double stranded form. Typically, site-directed 
mutagenesis in accordance herewith is performed by first obtaining a single-stranded 
vector which includes within its sequence a DNA sequence which encodes all or a 
portion of the Alloiococcus otitidis polypeptide sequence selected. An 
5 oligonucleotide primer bearing the desired mutated sequence is prepared (e.g., 
synthetically). This primer is then annealed to the singled-stranded vector, and 
extended by the use of enzymes such as Escherichia coii polymerase I Kienow 
fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a 
heterodupiex is formed wherein one strand encodes the original non-mutated 

10 sequence and the second strand bears the desired mutation. This heterodupiex 
vector is then used to transform appropriate cells such as Escherichia coii cells and 
clones are selected which include recombinant vectors bearing the mutation. 
Commercially available kits come with all the reagents necessary, except the 
oligonucleotide primers. 

15 An Alioiococcus otitidis polypeptide or polypeptide antigen of the present 

invention is understood to be any Alioiococcus otitidis polypeptide comprising 
substantial sequence similarity, structural similarity and/or functional similarity to an 
Ailoiococcus otitidis polypeptide comprising the amino acid sequence of one of the 
even numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. In 

20 addition, an Alioiococcus otitidis polypeptide or polypeptide antigen of the invention is 
not limited to a particular source. Thus, the invention provides for the general 
detection and isolation of the polypeptides from a variety of sources. 

It is contemplated in the present invention, that an Alloiococcus otitidis 
polypeptide may advantageously be cleaved into fragments for use in further 

25 structural or functional analysis, or in the generation of reagents such as Alloiococcus 
otitidis-relaXed polypeptides and Alloiococcus of/f/cf/s-specific antibodies. This can be 
accomplished by treating purified or unpurified Alioiococcus otitidis polypeptides with 
a peptidase such as endoproteinase glu-C (Boehringer, Indianapolis, IN). Treatment 
with CNBr is another method by which peptide fragments may be produced from 

30 natural Ailoiococcus otitidis polypeptides. Recombinant techniques also can be used 
to produce specific fragments of an Alloiococcus otitidis polypeptide. 

In addition, the inventors also contemplate that compounds sterically similar 
to a particular Alloiococcus otitidis polypeptide antigen, called peptidomimetics, may 
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be formulated to mimic the key portions of the peptide structure. Peptidemimetics 
are peptide-containing molecules that mimic elements of protein secondary structure. 
(See, for example, Johnson ef a/., 1993.) The underlying rationale behind the use of 
peptide mimetics is that the peptide backbone of proteins exists chiefly to orient 
5 amino acid side chains in such a way as to facilitate molecular interactions, such as 
those of receptor and ligand. 

Successful applications of the peptide mimetic concept have thus far focused 
on mimetics of p-turns within proteins. Likely p-tum structures, within Alloiococcus 
otitidis, can be predicted by computer-based algorithms as discussed above. Once 
10 the component amino acids of the turn are determined, mimetics can be constructed 
to achieve a similar spatial orientation of the essential elements of the amino acid 
side chains, as discussed in Johnson et a/., 1993. 

Fragments of the Alloiococcus otitidis polypeptides are also included in the 
invention. A fragment is a polypeptide having an amino acid sequence that entirely is 
15 the same as a part, but not all, of the amino acid sequence. The fragment can 
comprise, for example, at least 7 or more (e.g., 8, 10 12, 14, 16, 18, 20 or more) 
contiguous amino acids of an one of amino acid sequence selected from one of the 
even numbered sequences set forth in SEQ ID NO.: 2 through SEQ ID NO.: 106. 
Fragments may be "freestanding" or comprised within a larger polypeptide of which 
20 they form a part or region, most preferably as a single, continuous region. In one 
embodiment, the fragments include at least one epitope of the mature polypeptide 
sequence. 

"Fusion protein" refers to a protein encoded by two, often unrelated, fused 
genes or fragments thereof. For example, fusion proteins comprising various 

25 portions of constant region of immunoglobulin molecules together with another 

human protein or part thereof have been described. In many cases, employing an 
immunoglobulin Fc region as a part of a fusion protein is advantageous for use in 
therapy and diagnosis resulting in, for example, improved pharmacokinetic properties 
(see, e.g., EP-A 0232 2621 ). On the other hand, for some uses it would be desirable 

30 to be able to delete the Fc part after the fusion protein has been expressed, detected 
and purified. 
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D. Alloiococcus otitidis Polynucleotide and Polypeptide Variants 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that 
differs from a reference polynucleotide or polypeptide respectively, but retains 

5 essential properties. A typical variant of a polynucleotide differs in nucleotide 
sequence from another, reference polynucleotide. Changes in the nucleotide 
sequence of the variant may or may not alter the amino acid sequence of a 
polypeptide encoded by the reference polynucleotide. Nucleotide changes may 
result in amino acid substitutions, additions, deletions, fusions and truncations in the 

10 polypeptide encoded by the reference sequence, as discussed below. A typical 
variant of a polypeptide differs in amino acid sequence from another, reference 
polypeptide. Generally, differences are limited so that the sequences of the 
reference polypeptide and the variant are closely similar overall and, in many 
regions, identical. A variant and reference polypeptide may differ in amino acid 

15 sequence by one or more substitutions, additions and deletions in any combination. 
A substituted or inserted amino acid residue may or may not be one encoded by the 
genetic code. A variant of a polynucleotide or polypeptide may be a naturally 
occurring variant such as an allelic variant, or it may be a variant that is not known to 
occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides 

20 may be made by mutagenesis techniques or by direct synthesis. 

"Identity," as known in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
comparing the sequences. In the art, "identity" also means the degree of sequence 
relatedness between polypeptide or polynucleotide sequences, as the case may be, 

25 as determined by the match between strings of such sequences. "Identity" can be 
readily calculated by known methods, including but not limited to those described in 
(Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New 
York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, 

30 Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 
Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 
1991 ; and Carillo, H., and Lipman, D., SI AM J. Applied Math., 48: 1073 (1988). 
Preferred methods to determine identity are designed to give the largest match 
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between the sequences tested. Methods to determine identity are codified in publicly 
available computer programs. Preferred computer program methods to determine 
identity between two sequences include, but are not limited to, the GCG program 
package (Devereux, J., et a/ 1984), BLASTP, BLASTN, and FASTA (Altschul, S. F., 
5 et a/., 1990. The BLASTX program is publicly available from NCBI and other sources 
(BLAST Manual, Altschul, S., efa/., NCBI NLM NIH Bethesda, Md. 20894; Altschul, 
S., etal., 1990). The well known Smith-Waterman algorithm may also be used to 
determine identity. 

By way of example, a polynucleotide sequence of the present invention may 

10 be identical to the reference sequence of one of SEQ ID NO:1 through SEQ ID NO: 
105, that is be 100% identical, or it may include up to a certain integer number of 
nucleotide alterations as compared to the reference sequence. Such alterations are 
selected from the group consisting of at least one nucleotide deletion, substitution, 
including transition and transversion, or insertion, and wherein said alterations may 

15 occur at the 5' or 3' terminal positions of the reference nucleotide sequence or 

anywhere between those terminal positions, interspersed either individually among 
the nucleotides in the reference sequence or in one or more contiguous groups within 
the reference sequence. The number of nucleotide alterations is determined by 
multiplying the total number of nucleotides in one of the odd numbered sequences 

20 set forth in SEQ ID NO: 1 through SEQ ID NO: 1 05 by the numerical percent of the 
respective percent identity (divided by 100) and subtracting that product from said 
total number of nucleotides in one of the odd numbered sequences set forth in SEQ 
ID NO: 1 through SEQ ID NO: 105. 

For example, the alterations in an isolated Alloiococcus otitidis polynucleotide 

25 comprise a polynucleotide sequence that has at least 70% identity to the nucleic acid 
sequence of one of the odd numbered sequences set forth in SEQ ID NO: 1 through 
SEQ ID NO: 105; a degenerate variant thereof or a fragment thereof, wherein the 
polynucleotide sequence may include up to n n nucleic acid alterations over the entire 
polynucleotide region of the nucleic acid sequence of any on of the odd numbered 

30 sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 105, wherein n n is the 
maximum number of alterations and is calculated by the formula: 

n n < Xn-^y), 
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in which x n is the total number of nucleic acids of one of SEQ ID NO:1 through SEQ 
ID NO:105 and y has a value of 0.70, wherein any non-integer product of x n and y is 
rounded down to the nearest integer prior to subtracting such product from x„. Of 
course, y may also have a value of 0.80 for 80%, 0.85 for 85%, 0.90 for 90% 0.95 for 
5 95%, etc. 

Similarly, a polypeptide sequence of the present invention may be identical to 
the reference sequence of any one of even numbered sequences set forth in SEQ ID 
NO: 2 through SEQ ID NO: 1 06, that is 1 00% identical, or it may include up to a 
certain integer number of amino acid alterations as compared to the reference 

10 sequence such that the percentage identity is less than 1 00%. Such alterations are 
selected from the group consisting of at least one amino acid deletion, substitution, 
including conservative and non-conservative substitution, or insertion, and wherein 
said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 

15 interspersed either individually among the amino acids in the reference sequence or 
in one or more contiguous groups within the reference sequence. The number of 
amino acid alterations for a given % identity is determined by multiplying the total 
number of amino acids in one of the even numbered sequences set forth in SEQ ID 
NO: 2 through SEQ ID NO: 106 by the numerical percent of the respective percent 

20 identity (divided by 100) and then subtracting that product from said total number of 
amino acids in one of the even numbered sequences set forth in SEQ ID NO: 2 
through SEQ ID NO: 1 06, or: 

n a < Xrf-(Xa-y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino 
25 acids in one of SEQ ID NO: 2 through SEQ ID NO: 106, and y is, for instance 0.70 for 
70%, 0.80 for 80%, 0.85 for 85% etc., and wherein any non-integer product of 
x.sub.a and y is rounded down to the nearest integer prior to subtracting it from x a . 

E. Vectors, Host Cells and Recombinant Alloiococcus ormots 
30 Polypeptides 

In a preferred embodiment, the present invention provides expression vectors 
comprising ORF polynucleotides that encode Alloiococcus otitidis polypeptides. 
Preferably, the expression vectors of the present invention comprise ORF 
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polynucleotides that encode Alloiococcus otitidis polypeptides comprising the amino 
acid residue sequence of one of the even numbered sequences set forth in SEQ ID 
NO: 2 through SEQ ID NO: 106. More preferably, the expression vectors of the 
present invention comprise a polynucleotide comprising the nucleotide base 

5 sequence of the odd numbered sequences set forth in SEQ ID NO: 1 through SEQ 
ID NO: 105. Even more preferably, the expression vectors of the invention comprise 
a polynucleotide operatively linked to promoter. Still more preferably, the expression 
vectors of the invention comprise a polynucleotide operatively linked to a prokaryotic 
promoter. Alternatively, the expression vectors of the present invention comprise a 

10 polynucleotide operatively linked to an enhancer-promoter, that is, an eukaryotic 
promoter. The expression vectors further comprise a polyadenylation signal that is 
positioned 3' of the carboxy-terminal amino acid and within a transcriptional unit of 
the encoded polypeptide. 

Expression of proteins in prokaryotes is most often carried out in Escherichia 

15 coli with vectors containing constitutive or inducible promoters directing the 

expression of either fusion or non-fusion proteins. Fusion, vectors add a number of 
amino acids to a protein encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve three purposes: 1 ) to 
increase expression of recombinant protein; 2) to increase the solubility of the 

20 recombinant protein; and 3) to aid in the purification of the recombinant protein by 
acting as a ligand in affinity purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein from the fusion 
moiety subsequent to purification of the fusion protein. Such enzymes, and their 

25 cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 

Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; 
Smith and Johnson, 1988), pMAL (New England Biolabs, Beverly; MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 

30 In one embodiment, the coding sequence of the Alloiococcus otitidis 

polynucleotide is cloned into a pGEX expression vector to create a vector encoding a 
fusion protein comprising, from the N-terminus to the C-terminus, GST-thrombin 
cleavage site- Alloiococcus otitidis polypeptide. The fusion protein can be purified by 
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affinity chromatography using glutathione-agarose resin. Recombinant Alloiococcus 
otitidis polypeptide unf used to GST can be recovered by cleavage of the fusion 
protein with thrombin. 

Examples of suitable inducible non-fusion Escherichia coii expression vectors 

5 include pTrc (Amann et a/., 1 988) and pET 1 1 d (Studier et a/., 1 990). Target gene 
expression from the pTrc vector relies on host RNA polymerase transcription from a 
hybrid trp-iac fusion promoter. Target gene expression from the pET I I d vector 
relies on transcription from a T7 gn1 0-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase T7 gnl. This viral polymerase is supplied by host 

10 strains BL21 (DE3) or HMS I 74(DE3) from a resident prophage harboring a T7 gn1 
gene under the transcriptional control of the iacUV 5 promoter. 

One strategy to maximize recombinant protein expression in Escherichia cofi 
is to express the protein in a host bacterium with an impaired capacity to 
proteolytically cleave the recombinant protein. Another strategy is to aiter the nucleic 

15 acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in Escherichia 
coii. Such alteration of nucleic acid sequences of the invention can be carried out by 
standard DNA mutagenesis or synthesis techniques. 

In another embodiment, the Alioiococcus otitidis polynucleotide expression 

20 vector is a yeast expression vector. Examples of vectors for expression in a yeast 
such as S. cerevisiae include pYepSec I (Baldari, et al. t 1987), pMFa (Kurjan and 
Herskowitz, 1982), pJRY88 (Schultz et a/., 1987), and pYES2 (Invitrogen 
Corporation, San Diego, CA). 

Alternatively, an Ailoiococcus otitidis polynucleotide is expressed in insect 

25 cells using, for example, baculovirus expression vectors. Baculovirus vectors 

available for expression of proteins in cultured insect cells (e.g., Sf 9 or Sf 21 cells) 
include the pAc series (Smith et a/., 1 983) and the pVL series (Lucklow and 
Summers, 1989). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
30 mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987) and pMT2PC (Kaufman et a/., 
1987). When used in mammalian cells, the expression vector's control functions are 
often provided by viral regulatory elements. 
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As used herein, a promoter is a region of a DNA molecule typically within 
about 100 nucleotide pairs in front of (upstream of) the point at which transcription 
begins (/.e., a transcription start site). That region typically contains several types of 
DNA sequence elements that are located in similar relative positions in different 

5 genes. As used herein, the term "promoter 0 includes what is referred to in the art as 
an upstream promoter region, a promoter region or a promoter of a generalized 
eukaryotic RNA Polymerase II transcription unit. 

Another type of discrete transcription regulatory sequence element is an 
enhancer. An enhancer provides specificity of time, location and expression level for 

10 a particular encoding region (e.g., gene). A major function of an enhancer is to 

increase the level of transcription of a coding sequence in a cell that contains one or 
more transcription factors that bind to that enhancer. Unlike a promoter, an enhancer 
can function when located at variable distances from transcription start sites so long 
as a promoter is present. 

15 As used herein, the phrase "enhancer-promoter" means a composite unit that 

contains both enhancer and promoter elements. An enhancer-promoter is 
operatively linked to a coding sequence that encodes at least one gene product. As 
used herein, the phrase "operatively linked" means that an enhancer-promoter is 
connected to a coding sequence in such a way that the transcription of that coding 

20 sequence is controlled and regulated by that enhancer-promoter. Means for 

operatively linking an enhancer-promoter to a coding sequence are well known in the 
art. As is also well known in the art, the precise orientation and location relative to a 
coding sequence whose transcription is controlled, is dependent inter alia upon the 
specific nature of the enhancer-promoter. Thus, a TATA box minimal promoter is 

25 typically located from about 25 to about 30 base pairs upstream of a transcription 
initiation site and an upstream promoter element is typically located from about 100 
to about 200 base pairs upstream of a transcription initiation site, in contrast, an 
enhancer can be located downstream from the initiation site and can be at a 
considerable distance from that site. 

30 An enhancer-promoter used in a vector construct of the present invention can 

be any enhancer-promoter that drives expression in a cell to be transfected. By 
employing an enhancer-promoter with well-known properties, the level and pattern of 
gene product expression can be optimized. 
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For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus (CMV) and Simian Virus 40 (SV40). For other 
suitable expression systems for both prokaryotic and eukaryotic cells see chapters 
16 and 17 of Sambrook et a/., "Molecular Cloning: A Laboratory Manual" 2nd, ed, 

5 Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989, incorporated herein by reference. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell 
type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). 

10 Tissue- specific regulatory elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert 
ef a/., 1987), lymphoid-specific promoters (Calame and Eaton, 1988), in particular 
promoters of T cell receptors (Winoto and Baltimore, 1 989) and immunoglobulins 
(Banerji et al., 1 983), Queen and Baltimore (1 983), neuron-specific promoters (e.g., 

15 the neurofilament promoter; Byrne and Ruddle, 1 989), pancreas-specific promoters 
(Edlund eta!., 1985), and mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. 4, 873,316 and EP 264,166). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss, 
1990) and the ct-fetoprotein promoter (Campes and Tilghman, 1989). 

20 The invention further provides a recombinant expression vector comprising a 

DNA molecule encoding an Alloiococcus otitidis polypeptide cloned into the 
expression vector in an antisense orientation. That is, the DNA molecule is 
operatively linked to a regulatory sequence in a manner which allows for expression 
(by transcription of the DNA molecule) of an RNA molecule which is antisense to 

25 Alloiococcus otitidis mRNA. Regulatory sequences operatively linked to a nucleic 
acid cloned in the antisense orientation can be chosen which direct the continuous 
expression of the antisense RNA molecule in a variety of cell types, for instance viral 
promoters and/or enhancers, or regulatory sequences can be chosen which direct 
constitutive, tissue specific or cell type specific expression of antisense RNA. The 

30 antisense expression vector can be in the form of a recombinant plasmid, phagemid 
or attenuated virus in which antisense nucleic acids are produced under the control 
of a high efficiency regulatory region, the activity of which can be determined by the 
cell type into which the vector is introduced. 
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Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms 
"host cell" and "recombinant host cell" are used interchangeably herein. It is 
understood that such terms refer not only to the particular subject cell but also to the 

5 progeny or potential progeny of such a cell. Because certain modifications may 
occur in succeeding generations due to either mutation or environmental influences, 
such progeny may not, in fact, be identical to the parent cell, but are still included 
within the scope of the term as used herein. A host cell can be any prokaryotic or 
eukaryotic cell. For example, an Alloiococcus otitidis polypeptide can be expressed 

10 in bacterial cells such as Escherichia coli, insect cells, yeast or mammalian cells 

(such as Chinese hamster ovary cells (CHO), NIH3T3, PER C6, NSO, VERO or COS 
cells). Other suitable host cells are known to those skilled in the art. 

Vector DNA is can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation, infection or transfection techniques. As used herein, the 

15 terms "transformation" and "transfection" are intended to refer to a variety of art- 
recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran- 
mediated transfection, lipofection, protoplast fusion, direct microinfection. Another 
recognized technique for introducing DNA into a host cell is "infection", such as by 

20 adenovirus infection or electroporation. Suitable methods for transforming, infecting 
or transfecting host cells can be found in Sambrook, ef ai ("Molecular Cloning: A 
Laboratory Manual" 2nd ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 
The most widely used method is transfection mediated by either calcium 

25 phosphate or DEAE-dextran. Although the mechanism remains unclear, it is 

believed that the transfected DNA enters the cytoplasm of the cell by endocytosis 
and is transported to the nucleus. Depending on the cell type, up to 90% of a 
population of cultured cells can be transfected at any one time. Because of its high 
efficiency, transfection mediated by calcium phosphate or DEAE-dextran is the 

30 method of choice for experiments that require transient expression of the foreign 
DNA in large numbers of cells. Calcium phosphate-mediated transfection is also 
used to establish cell lines that integrate copies of the foreign DNA, which are usually 
arranged in head-to-tail tandem arrays into the host cell genome. 
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In the protoplast fusion method, protoplasts derived from bacteria carrying 
high numbers of copies of plasmid of interest are mixed directly with cultured 
mammalian cells. After fusion of the cell membranes (usually with polyethylene 
glycol), the contents of the bacteria are delivered into the cytoplasm of the 

5 mammalian cells and the plasmid DNA is transported to the nucleus. Protoplast 
fusion is not as efficient as transfection for many of the cell lines that are commonly 
used for transient expression assays, but "rt is useful for cell lines in which 
endocytosis of DNA occurs inefficiently. Protoplast fusion frequently yields multiple 
copies of the plasmid DNA tandemly integrated into the host chromosome. 

10 The application of brief, high-voltage electric pulses (electroporation) to a 

variety of mammalian and plant cells leads to the formation of nanometer-sized pores 
in the plasma membrane. DNA is taken directly into the cell cytoplasm either through 
these pores or as a consequence of the redistribution of membrane components that 
accompanies closure of the pores. Electroporation can be extremely efficient and 

15 can be used both for transient expression of cloned genes and for establishment of 
cell lines that carry integrated copies of the gene of interest. Electroporation, in 
contrast to calcium phosphate-mediated transfection and protoplast fusion, frequently 
gives rise to cell lines that carry one, or at most a few, integrated copies of the 
foreign DNA. 

20 Liposome transfection involves encapsulation of DNA and RNA within 

liposomes, followed by fusion of the liposomes with the cell membrane. The 
mechanism of how DNA is delivered into the cell is unclear, but transfection 
efficiencies can be as high as 90%. 

Direct microinjection of a DNA molecule into nuclei has the advantage of not 
25 exposing DNA to cellular compartments such as low-pH endosomes. Microinjection 
therefore used primarily as a method to establish lines of cells that carry integrated 
copies of the DNA of interest. 

The use of adenovirus as a vector for cell transfection is well known in the art. 
Adenovirus vector-mediated cell transfection has been reported for various cells 
30 (Stratford-Perricaudet, etal. 1992). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, is used to produce (i.e., express) an AHoiococcus otitidis polypeptide. 
Accordingly, the invention further provides methods for producing an AHoiococcus 
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otitidis polypeptide using the host cells of the invention. In one embodiment, the 
method comprises culturing the host cell of invention (into which a recombinant 
expression vector encoding an Alloiococcus otitidis polypeptide has been introduced) 
in a suitable medium until the Alloiococcus otitidis polypeptide is produced. In 
5 another embodiment, the method further comprises isolating the Alloiococcus otitidis 
polypeptide from the medium or the host cell. 

A coding sequence of an expression vector is operatively linked to a 
transcription-terminating region. RNA polymerase transcribes an encoding DNA 
sequence through a site where poiyadenylation occurs. Typically, DNA sequences 

10 located a few hundred base pairs downstream of the poiyadenylation site serve to 
terminate transcription. Those DNA sequences are referred to herein as 
transcription-termination regions. Those regions are required for efficient 
poiyadenylation of transcribed messenger RNA (mRNA). Transcription-terminating 
regions are well known in the art. A preferred transcription-terminating region used in 

15 an adenovirus vector construct of the present invention comprises a poiyadenylation 
signal of SV40 or the protamine gene. 

An expression vector comprises a polynucleotide that encodes an 
Alloiococcus otitidis polypeptide. Such a polypeptide is meant to include a sequence 
of nucleotide bases encoding an Alloiococcus otitidis polypeptide sufficient in length 

20 to distinguish the segment from a polynucleotide segment encoding a non- 

Alloiococcus otitidis polypeptide. A polypeptide of the invention can also encode 
biologically functional polypeptides or peptides which have variant amino acid 
sequences, such as with changes selected based on considerations such as the 
relative hydropathic score of the amino acids being exchanged. These variant 

25 sequences are those isolated from natural sources or induced in the sequences 
disclosed herein using a mutagenic procedure such as site-directed mutagenesis. 

Preferably, an expression vector of the present invention comprises a 
polynucleotide that encodes a polypeptide comprising the amino acid residue 
sequence of one of the even numbered sequences set forth in SEQ ID NO: 2 through 

30 SEQ ID NO:.4036 An expression vector can include an Alloiococcus otitidis 

polypeptide coding region itself of any of the Alloiococcus otitidis polypeptides noted 
above or it can contain coding regions bearing selected alterations or modifications in 
the basic coding region of such an Alloiococcus otitidis polypeptide. Alternatively, 
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such vectors or fragments can also encode larger polypeptides or polypeptides which 
nevertheless include the basic coding region. In any event, it should be appreciated 
that due to codon redundancy as well as biological functional equivalence, this 
aspect of the invention is not limited to the particular DNA molecules corresponding 
5 to the polypeptide sequences noted above. 

Exemplary vectors include the mammalian expression vectors of the pCMV 
family including pCMV6b and pCMV6c (Chiron Corp., Emeryville CA.)- In certain 
cases, and specifically in the case of these individual mammalian expression vectors, 
the resulting constructs can require co-transfection with a vector containing a 

10 selectable marker such as pSV2neo. Via co-transfection into a dihydrofolate 
reductase-deficient Chinese hamster ovary cell line, such as DG44, clones 
expressing Alloiococcus otitidis polypeptides by virtue of DNA incorporated into such 
expression vectors can be detected. 

A DNA molecule of the present invention can be incorporated into a vector by 

15 a number of techniques that are well known in the art. For instance, the vector 

pUC18 has been demonstrated to be of particular value in cloning and expression of 
genes. Likewise, the related vectors M13mp18 and M13mp19 can also be used in 
certain embodiments of the invention, in particular, in performing dideoxy 
sequencing. 

20 An expression vector of the present invention is useful both as a means for 

preparing quantities of the Alloiococcus otitidis polypeptide-encoding DNA itself, and 
as a means for preparing the encoded polypeptide and peptides. It is contemplated 
that where Alloiococcus otitidis polypeptides of the invention are made by 
recombinant means, one can employ either prokaryotic or eukaryotic expression 

25 vectors as shuttle systems. In another aspect, the recombinant host cells of the 

present invention are prokaryotic host cells. Preferably, the recombinant host cells of 
the invention are bacterial cells of the DH5a strain of Escherichia coli. In general, 
prokaryotes are preferred for the initial cloning of DNA sequences and constructing 
the vectors useful in the invention. For example, Escherichia coli K1 2 strains can be 

30 particularly useful. Other microbial strains that can be used include Escherichia coli 
B, Escherichia co//W3110 (ATCC No. 273325) and Escherichia, co/^1976 (ATCC 
No. 31 537). Bacilli such as Bacillus subtilis, or other enterobacteriaceae such as 
Salmonella typhimurium or other Salmonella species or Serratia marcesans, and 
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various pseuciomonas species can be used. These examples are, of course, 
intended to be illustrative rather than limiting. 

In general, plasmid vectors containing replicon and control sequences that 
are derived from species compatible with the host cell are used in connection with 
5 these hosts. The vector ordinarily carries a replication site, as well as marking 

sequences that are capable of providing phenotypic selection in transformed cells. 
For example, Escherichia coli can be transformed using pBR322, a plasmid derived 
from an Escherichia coll species (Bolivar, etai 1977). pBR322 contains genes for 
ampicillin and tetracycline resistance and thus, provides easy means for identifying 

10 transformed cells. The pBR322 plasmid, or other microbial plasmid or phage, must 
also contain, or be modified to contain, promoters which can be used by the microbial 
organism for expression of its own polypeptides. 

Those promoters most commonly used in recombinant DNA construction 
include the P-lactamase (penicillinase) and lactose promoter systems (Chang, et al. 

15 1978; Itakura., etai 1977, Goeddei, et al. 1979; Goeddel, etai. 1980) and a 

tryptophan (TRP) promoter system (EP 0036776; Siebwenlist etai. 1980). While 
these are the most commonly used, other microbial promoters have been discovered 
and utilized, and details concerning their nucleotide sequences have been published, 
enabling a skilled worker to introduce functional promoters into plasmid vectors 

20 (Siebwenlist, etai. 1980). 

In addition to prokaryotes, eukaryotic microbes such as yeast can also be 
used. Saccharomyces cerevisiase or common baker's yeast is the most commonly 
used among eukaryotic microorganisms, although a number of other strains are 
commonly available. For expression in Saccharomyces, the plasmid YRp7, for 

25 example, is commonly used (Stinchcomb, et al. 1979; Kingsman, etai 1979; 

Tschemper, etai 1980). This plasmid already contains the trpl gene that provides a 
selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, 
for example ATCC No. 44076 or PEP4-1 (Jones, 1 977). The presence of the trpl 
lesion as a characteristic of the yeast host cell genome then provides an effective 

30 environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 3- 
phosphoglycerate kinase (PGK) (Hitzeman, et al 1980) or other glycolytic enzymes 
(Hess, etai 1968; Holland, etai. 1978) such as enolase, glyceraldehyde-3- 
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phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and 
glucokinase. In constructing suitable expression plasmids, the termination sequences 
5 associated with these genes are also introduced into the expression vector 

downstream from the sequences to be expressed to provide polyadenylation of the 
mRNA and termination. Other promoters, which have the additional advantage of 
transcription controlled by growth conditions are the promoter region for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes 

10 associated with nitrogen metabolism, and the aforementioned glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose 
utilization. Any plasmid vector containing a yeast-compatible promoter, origin of 
replication, and termination sequences is suitable. 

In addition to microorganisms, cultures of cells derived from multicellular 

15 organisms can also be used as hosts. In principle, any such cell culture is workable, 
whether from vertebrate or invertebrate culture. However, interest has been greatest 
in vertebrate cells, and propagation of vertebrate cells in culture (tissue culture) has 
become a routine procedure in recent years. Examples of such useful host cell lines 
are AtT-20, VERO, HeLa, NSO, PER C6, Chinese hamster ovary (CHO) cell lines, 

20 W138, BHK, COSM6, COS-7, 293 , VERO and MDCK cell lines. Expression vectors 
for such cells ordinarily include (if necessary) an origin of replication, a promoter 
located upstream of the gene to be expressed, along with any necessary ribosome 
binding sites, RNA splice sites, polyadenylation site, and transcriptional terminator 
sequences. 

25 Where expression of recombinant Alloiococcus otitidis polypeptides is desired 

and a eukaryotic host is contemplated, it is most desirable to employ a vector, such 
as a plasmid, that incorporates a eukaryotic origin of replication. Additionally, for the 
purposes of expression in eukaryotic systems, one desires to position the 
Alloiococcus otitidis encoding sequence adjacent to and under the control of an 

30 effective eukaryotic promoter such as promoters used in combination with Chinese 
hamster ovary cells (CHO). To bring a coding sequence under control of a promoter, 
whether it is eukaryotic or prokaryotic, what is generally needed is to position the 5' 
end of the translation initiation side of the proper translational reading frame of the 
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polypeptide between about 1 and about 50 nucleotides 3' of or downstream with 
respect to the promoter chosen. Furthermore, where eukaryotic expression is 
anticipated, one would typically desire to incorporate an appropriate polyadenylation 
site into the transcriptional unit that includes the Alloiococcus otitidis polypeptide. 
5 . a transfected cell can be prokaryotic or eukaryotic. Preferably, the host cells 

of the invention are prokaryotic host cells. Where it is of interest to produce an 
Alloiococcus otitidis polypeptide, cultured prokaryotic host cells are of particular 
interest. 

In yet another embodiment, the present invention contemplates a process or 
10 method of preparing Ailoiococcus otitidis polypeptides comprising transfecting, 
transforming or infecting cells with a polynucleotide that encodes an Alloiococcus 
otitidis polypeptide to produce transformed host cells; and maintaining the 
transformed host cells under biological conditions sufficient for expression of the 
polypeptide. Preferably, the transformed host cells are prokaryotic cells. 
15 Alternatively, the host cells are eukaryotic cells. More preferably, the prokaryotic 
cells are bacterial cells of the DH5a strain of Escherichia coil Even more preferably, 
the polynucleotide transfected into the transformed cells comprises the nucleic acid 
sequence of one of the odd numbered sequences set forth in SEQ ID NO: 1 through 
SEQ ID NO: 105. Additionally, transfection is accomplished using an expression 
20 vector disclosed above. A host cell used in the process is capable of expressing a 
functional, recombinant Alloiococcus otitidis polypeptide. 

Following transfection, the cell is maintained under culture conditions for a 
period of time sufficient for expression of an Alloiococcus otitidis polypeptide. Culture 
conditions are well known in the art and include ionic composition and concentration, 
25 temperature, pH and the like. Typically, transfected cells are maintained under 

culture conditions in a culture medium. Suitable media for various cell types are well 
known in the art. In a preferred embodiment, temperature is from about 20°C to 
about 50°C, more preferably from about 30°C to about 40°C and, even more 

preferably about 37°C. 
30 The pH is preferably from about a value of 6.0 to a value of about 8.0, more 

preferably from about a value of about 6.8 to a value of about 7.8 and, most 
preferably about 7.4. Osmolality is preferably from about 200 miiliosmols per liter 
(mosm/L) to about 400 mosm/l and, more preferably from about 290 mosm/L to 
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about 31 0 mosm/L. Other biological conditions needed for transfection and 

expression of an encoded protein are well known in the art. 

Transfected cells are maintained for a period of time sufficient for expression 

of an Alloiococcus otitidis polypeptide. A suitable time depends inter alia upon the 
5 cell type used and is readily determinable by a skilled artisan. Typically, 

maintenance time is from about 2 to about 1 4 days. 

Recombinant Alioiococcus otitidis polypeptide is recovered or collected either 

from the transfected cells or the medium in which those cells are cultured. Recovery 

comprises isolating and purifying the Alloiococcus otitidis polypeptide. Isolation and 
10 purification techniques for polypeptides are well known in the art and include such 

procedures as precipitation, filtration, chromatography, electrophoresis and the like. 

F. ANTIBODIES IMMUNOREACTIVE WITH ALLOIOCOCCUS OTITIDIS POLYPEPTIDES 

15 In still another embodiment, the present invention provides antibodies 

immunoreactive with Alloiococcus otitidis polypeptides. Preferably, the antibodies of 
the invention are monoclonal antibodies. Additionally, the Alloiococcus otitidis 
polypeptides comprise the amino acid residue sequence of one of the even 
numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. Means 

20 for preparing and characterizing antibodies are well known in the art (See, e.g., 
Antibodies "A Laboratory Manual", E. Howell and D. Lane, Cold Spring Harbor 
Laboratory, 1988). Polyclonal antisera is obtained by bleeding an immunized animal 
into a glass or plastic container, incubating the blood at 25°C for one hour, followed 
by incubating at 4°C for 2-18 hours. The serum is then recovered by centrifugation. 

25 Briefly, a polyclonal antibody is prepared by immunizing an animal with an 

immunogen comprising a polypeptide or polynucleotide of the present invention, and 
collecting antisera from that immunized animal. A wide range of animal species can 
be used for the production of antisera. Typically an animal used for production of 
anti-antisera is a rabbit, a mouse, a rat, a hamster or a guinea pig. Because of the 

30 relatively large blood volume of rabbits, a rabbit is a preferred choice for production 
of polyclonal antibodies. 

As is well known in the art, a given polypeptide or polynucleotide may vary in 
its immunogenicity. It is often necessary therefore to couple the immunogen (e.g., a 
polypeptide or polynucleotide) of the present invention with a carrier. Exemplary and 
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preferred carriers are keyhole limpet hemocyanin (KLH) and bovine serum albumin 
(BSA). Other albumins such as ovalbumin, mouse serum albumin or rabbit serum 
albumin can also be used as carriers. 

Means for conjugating a polypeptide or a polynucleotide to a carrier protein 

5 are well known in the art and include glutaraldehyde, m-maleimidobencoyl-N- 
hydroxysuccinimide ester, carbodiimide and bis-biazotized benzidine. 

As is also well known in the art, immunogencity to a particular immunogen 
can be enhanced by the use of non-specific stimulators of the immune response 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's 

10 adjuvant, incomplete Freund's adjuvants, cholera toxin (e.g. mutant cholera toxin 
E29H; see published International Patent Application WO 00/18434), and aluminum 
hydroxide adjuvant. 

The amount of immunogen used for the production of polyclonal antibodies 
depends upon the nature of the immunogen as well as the animal used for 

15 immunization. A variety of routes can be used to administer the immunogen 

(subcutaneous, intramuscular, intradermal, intravenous and intraperitoneal). The 
production of polyclonal antibodies is monitored by sampling blood from the 
immunized animal at various points following immunization. When a desired level of 
immunogenicity is obtained, the immunized animal can be bled and the serum 

20 isolated and stored. 

In another aspect, the present invention contemplates a process of producing 
an antibody immunoreactive with an Alloiococcus otitidis polypeptide comprising the 
steps of (a) transfecting recombinant host cells with a polynucleotide that encodes an 
Alloiococcus otitidis polypeptide; (b) culturing the host cells under conditions 

25 sufficient for expression of the polypeptide; (c) recovering the polypeptides; and (d) 
preparing the antibodies to the polypeptides. Preferably, the host cell is transfected 
with the polynucleotide of one of the odd numbered sequences set forth in SEQ ID 
NO: 1 through SEQ ID NO: 4035. Even more preferably, the present invention 
provides antibodies prepared according to the process described above. 

30 A monoclonal antibody of the present invention can be readily prepared 

through use of weil-known techniques such as those exemplified in U.S. Pat. No. 
4,196,265, herein incorporated by reference. Typically, a technique involves first 
immunizing a suitable animal with a selected antigen (e.g., a polypeptide or 
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polynucleotide of the present invention) in a manner sufficient to provide an immune 
response. Rodents such as mice and rats are preferred animals. Spleen cells from 
the immunized animal are then fused with cells of an immortal myeloma cell. Where 
the immunized animal is a mouse, a preferred myeloma cell is a murine NS-1 
5 myeloma cell. 

The fused spleen/myeloma cells are cultured in a selective medium to select 
fused spleen/myeloma cells from the parental cells. Fused cells are separated from 
the mixture of non-fused parental cells, e.g., by the addition of agents that block the 
de novo synthesis of nucleotides in the tissue culture media. Exemplary and 
10 preferred agents are aminopterin, methotrexate, and azaserine. Aminopterin and 
methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, 
the media is supplemented with hypoxanthine and thymidine as a source of 
nucleotides. Where azaserine is used, the media is supplemented with 

15 hypoxanthine. 

This culturing provides a population of hybridomas from which specific 
hybridomas are selected. Typically, selection of hybridomas is performed by 
culturing the cells by single-clone dilution in microtiter plates, followed by testing the 
individual clonal supernatants for reactivity with an antigen-polypeptide. The 

20 selected clones can then be propagated indefinitely to provide the monoclonal 
antibody. 

By way of specific example, to produce an antibody of the present invention, 
mice are injected intraperitoneally with between about 1-200 y.g of an antigen 
comprising a polypeptide of the present invention. B lymphocyte cells are stimulated 

25 to grow by injecting the antigen in association with an adjuvant such as complete 

Freund's adjuvant (CFA; a non-specific stimulator of the immune response containing 
killed Mycobacterium tuberculosis). At some time (e.g., at least two weeks) after the 
first injection, mice are boosted by injection with a second dose of the antigen mixed 
with incomplete Freund's adjuvant (IFA; lacks the killed mycobacterium of CFA). 

30 A few weeks after the second injection, mice are tail bled and the sera titered 

by immunoprecipitation against radiolabeled antigen. Preferably, the process of 
boosting and titering is repeated until a suitable titer is achieved. The spleen of the 
mouse with the highest titer is removed and the spleen lymphocytes are obtained by 
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homogenizing the spleen with a syringe. Typically, a spleen from an immunized 
mouse contains approximately 5x1 0 7 to 2x1 0 8 lymphocytes. 

Mutant lymphocyte cells known as myeloma cells are obtained from 
laboratory animals in which such cells have been induced to grow by a variety of 

5 well-known methods. Myeloma ceils lack the salvage pathway of nucleotide 
biosynthesis. Because myeloma cells are tumor ceils, they can be propagated 
indefinitely in tissue culture, and are thus denominated immortal. Numerous cultured 
cell lines of myeloma cells from mice and rats, such as murine NS-1 myeloma cells, 
have been established. 

10 Myeloma cells are combined under conditions appropriate to foster fusion 

with the normal antibody-producing cells from the spleen of the mouse or rat injected 
with the antigen/polypeptide of the present invention. Fusion conditions include, for 
example, the presence of polyethylene glycol. The resulting fused cells are 
hybridoma cells. Like myeloma cells, hybridoma cells grow indefinitely in culture. 

15 Hybridoma cells are separated from unfused myeloma cells by culturing in a 

selection medium such as HAT media (hypoxanthine, aminopterin, thymidine). 
Unfused myeloma cells lack the enzymes necessary to synthesize nucleotides from 
the salvage pathway because they are killed in the presence of aminopterin, 
methotrexate, or azaserine. Unfused lymphocytes also do not continue to grow in 

20 tissue culture. Thus, only ceils that have successfully fused (hybridoma cells) can 
grow in the selection media. 

Each of the surviving hybridoma cells produces a single antibody. These 
cells are then screened for the production of the specific antibody immunoreactive 
with an antigen/polypeptide of the present invention. Single cell hybridomas are 

25 isolated by limiting dilutions of the hybridomas. The hybridomas are serially diluted 
many times and, after the dilutions are allowed to grow, the supernatant is tested for 
the presence of the monoclonal antibody. The clones producing that antibody are 
then cultured in large amounts to produce an antibody of the present invention in 
convenient quantity. 

30 By use of a monoclonal antibody of the present invention, specific - 

polypeptides and polynucleotide of the invention are identified as antigens. Once 
identified, those polypeptides and polynucleotide are isolated and purified by 
techniques such as antibody-affinity chromatography. In antibody-affinity 
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chromatography, a monoclonal antibody is bound to a solid substrate and exposed to 
a solution containing the desired antigen. The antigen is removed from the solution 
through an immunospecific reaction with the bound antibody. The polypeptide or 
polynucleotide is then easily removed from the substrate and purified. 

5 Additionally, examples of methods and reagents particularly amenable for use 

in generating and screening antibody display library can be found in, for example, 
U.S. 5,223,409; WO 92/18619; WO 91/17271; WO 92/20791; WO 92/1 5679; WO 
93/01288; WO 92/01047; WO 92/09690; WO 90/02809, which are incorporated 
herein in their entirety by reference. 

10 Additionally, recombinant ant\-Alloiococcus otitidis antibodies, such as 

chimeric and humanized monoclonal antibodies, comprising both human and non- 
human fragments, which are made using standard recombinant DNA techniques, are 
within the scope of the invention. Such chimeric and humanized monoclonal 
antibodies are produced by recombinant DNA techniques known in the art, for 

15 example using methods described in PCT/US86/02269; EP 184,187; EP 171 ,496; 
EP 173,494; WO 86/01533; U.S. 4,816,567; and EP 125,023. 

An ant\-Alloiococcus otitidis antibody (e.g., monoclonal antibody) is used to 
isolate Alloiococcus otitidis polypeptides by standard techniques, such as affinity 
chromatography or immunoprecipitation. An anW-Alloiococcus otitidis antibody 

20 facilitates the purification of a natural Afioiococcus otitidis polypeptide from cells and • 
recombinantly produced Alioiococcus otitidis polypeptides expressed in host cells. 
Moreover, an anti->A//o/ococcL/s otitidis antibody is used to detect Alloiococcus otitidis 
polypeptide (e.g., in a cellular lysate or cell supernatant) in order to evaluate the 
abundance of the Alloiococcus otitidis polypeptide. The detection of circulating 

25 fragments of an Alloiococcus otitidis polypeptide is used to identify Alloiococcus 
otitidis polypeptide turnover in a subject. Anti-/4//o/ococcus otitidis antibodies are 
used diagnostically to monitor protein levels in tissue as part of a clinical testing 
procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. 
Detection is facilitated by coupling (/.e., physically linking) the antibody to a 

30 detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 

* 
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acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylarnine fluorescein, dansyl chloride or phycoerythrin; an example of a 
5 luminescent material includes iuminol; examples of bioluminescent materials include 
luciferase, lucrferin, and acquorin, and examples of suitable radioactive material 
include 12 V 31 l, 15 Sor 3 H. 

G. Pharmaceutical Compositions 

10 . . 

In certain embodiments, the present invention provides pharmaceutical 

compositions comprising compounds that inhibit the activities of Alloiococcus otitidis 

polypeptides, and physiologically acceptable carriers. Compounds that inhibit the 

activities of Alloiococcus otitidis polypeptides polypeptides, which are essential for 

15 the proliferation of the bacteria, are identified using one or more assay systems set 
forth in Examples 5-38. More preferably, the pharmaceutical compositions comprise 
one or more compounds that inhibit the activities of Alloiococcus otitidis polypeptides 
comprising the amino acid residue sequence of one or more of the even numbered 
sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. In other 

20 embodiments, the pharmaceutical compositions of the invention comprise antisense 
polynucleotides of polynucleotides selected from one of the odd numbered 
sequences set forth in Seq. ID NO. 1 to Seq. ID No. 105, and physiologically 

acceptable carriers. 

Various tests are to be used to assess the in vitro and in vivo efficacy of 

25 anitmicrbbial and pharmaceutical compounds that inhibit the activities of Alloiococcus 
otitidis polypeptides, and these are set forth in detail in Examples 5 through 38. For 
example, an in vitro activity of the compounds may be assayed by incubating 
together a mixture of Alloiococcus otitidis or other heterologous bacterial ceils such 
as E. cofi cells expressing Alloiococcus otitidis polypeptides set forth in one of the 

30 even numbered sequences from Seq. ID No. 2 to Seq. ID No. 106, and then 

measuring the activity of the polypeptide using one or more of the assay systems 
detailed in Example 5 through 38. 

The Alloiococcus otitidis polynucleotides, polypeptides, compounds that 
modulate the activity of an Alloiococcus otitidis polypeptides, and anti-/\//o/ococcus 
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otitidis antibodies (also referred to herein as "active compounds") of the invention can 
be incorporated into pharmaceutical compositions suitable for administration to a 
host or subject, e.g., a human. Such compositions typically comprise the nucleic acid 
molecule, protein, antimicrobial compound, or antibody and a pharmaceutical^ 
5 acceptable carrier. As used herein the language "pharmaceutical^ acceptable 
carrier M is intended to include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and absorption delaying agents, and the 
like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutical^ active substances is well known in the art. Except 
10 insofar as any conventional media or agent is incompatible with the active 
compound, such media can be used in the compositions of the invention. 
Supplementary active compounds can also be incorporated into the compositions. 

A pharmaceutical of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include 
15 parenteral, (e.g., intravenous, intradermal, subcutaneous, intraperitoneal), 

transmucosal {e.g., oral, rectal, intranasal, vaginal, respiratory), and transdermal 
(topical). Solutions or suspensions used for parenteral, intradermal, or subcutaneous 
application can include the following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol 
20 or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents 
such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or 
phosphates and agents for the adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
25 sodium hydroxide. The parenteral preparation can be enclosed in ampoules, 
disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile 
aqueous solutions (where water-soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
30 intravenous administration, suitable carriers include physiological saline, 

bacteriostatic water, Cremophor EL™(BASF, Parsippany, NJ) or phosphate buffered 
saline (PBS). In ail cases, the composition must be sterile and should be fluid to the 
extent that easy syringability exists. It must be stable under the conditions of 
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can be included as part of the composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or compounds of a similar 
nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
5 Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or 
saccharin; or a flavoring agent such as peppermint, methyl salicylate, or orange 
flavoring. 

For administration by inhalation, the compounds are delivered in the form of 
10 an aerosol spray from pressured container or dispenser that contains a suitable 
propeliant, e.g., a gas such as carbon dioxide, or a nebulizer. Systemic 
administration can also be by transmucosal or transdermal means. For transmucosal 
or transdermal administration, penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are generally known in the art, and 
15 include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

20 The compounds can also be prepared in the form of suppositories (e.g., with 

conventional suppository bases such as cocoa butter and other glycerides) or 
retention enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
25 release formulation, including implants and microencapsulated delivery systems. 

Biodegradable, biocompatible polymers can be used, such as ethylene vinyl 
acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic 
acid. Methods 

30 H. Diagnostic Assays 

The invention also provides methods for detecting the presence of an 
Alloiococcus otitidis polypeptide or Alloiococcus otitidis polynucleotide, or fragment 
thereof, in a biological sample. The method involves contacting the biological sample 
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with a compound or an agent capable of detecting an AHoiococcus otitidis 
polypeptide or mRNA such that the presence of the AHoiococcus otitidis 
polypeptide/encoding nucleic acid molecule is detected in the biological sample. A 
preferred agent for detecting AHoiococcus otitidis mRNA or DNA is a labeled or 

5 labelable oligonucleotide probe capable of hybridizing to AHoiococcus otitidis mRNA 
or DNA. The nucleic acid probe can be, for example, a full-length AHoiococcus 
otitidis polynucleotide of one of the odd numbered sequences set forth in SEQ ID 
NO: 1 through SEQ ID NO: 105, a complement thereof, or a fragment thereof, such 
as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

10 sufficient to specifically hybridize under stringent conditions to AHoiococcus otitidis 
mRNA or DNA. Alternatively, the sample can be contacted with an oligonucleotide 
primer of an AHoiococcus otitidis polynucleotide of SEQ ID NO: 1 through SEQ ID 
:105, a complement thereof, or a fragment thereof, in the presence of nucleotides 
and a polymerase, under conditions permitting primer extension. 

15 A preferred agent for detecting AHoiococcus otitidis polypeptide is a labeled or 

labelable antibody capable of binding to an AHoiococcus otitidis polypeptide. 
Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or 
a fragment thereof (e.g., Fab or F(ab')2) can be used. The term "labeled Or 
labelable," with regard to the probe or antibody, is intended to encompass direct 

20 labeling of the probe or antibody by coupling (/.a, physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 

25 detected with fluorescently labeled streptavidin. The term "biological sample" is 

intended to include tissues, cells and biological fluids isolated from a subject, as well 
as tissues, cells and fluids present within a subject. That is, the detection method of 
the invention can be used to detect AHoiococcus otitidis mRNA, DNA or protein in a 
biological sample in vitro as well as in vivo. For example, in vitro techniques for 

30 detection of AHoiococcus otitidis mRNA include Northern hybridizations and in situ 
hybridizations, in vitro techniques for detection of AHoiococcus otitidis polypeptide 
include enzyme linked immunosorbent assays (ELISAs), Western-blots, 
immunoprecipitations and immunofluorescence. Alternatively, AHoiococcus otitidis 
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polypeptides can be detected in vivo in a subject by introducing into the subject a 
labeled antl-Alloiococcus otitidis antibody. For example, the antibody can be labeled 
with a radioactive marker whose presence and location in a subject can be detected 
by standard imaging techniques. 

5 The polynucleotides according to the invention may also be used in analytical 

DNA chips, which allow sequencing, the study of mutations and of the expression of 
genes, and which are currently of interest given their very small size and their high 
capacity in terms of number of analyses. 

The principle of the operation of these chips is based on molecular probes, 

10 most often oligonucleotides, which are attached onto a miniaturized surface, 

generally of the order of a few square centimeters. During an analysis, a sample 
containing fragments of a target nucleic acid to be analyzed, for example DNA or 
RNA labeled, for example, after amplification, is deposited onto the DNA chip in 
which the support has been coated beforehand with probes. Bringing the labeled 

15 target sequences into contact with the probes leads to the formation, through 

hybridization, of a duplex according to the rule of pairing defined by J.D. Watson and 
F. Crick. After a washing step, analysis of the surface of the chip allows the effective 
hybridizations to be located by means of the signals emitted by the labels tagging the 
target. A hybridization fingerprint results from this analysis which, by appropriate 

20 computer processing, will make it possible to determine information such as the 
presence of specific fragments in the sample, the determination of sequences and 
the presence of mutations. 

The chip consists of a multitude of molecular probes, precisely organized or 
arrayed on a solid support whose surface is miniaturized. It is at the center of a 

25 system where other elements (imaging system, microcomputer) allow the acquisition 
and interpretation of a hybridization fingerprint. 

The hybridization supports are provided in the form of flat or porous surfaces 
(pierced with wells) composed of various materials. The choice of a support is 
determined by its physicochemical properties, or more precisely, by the relationship 

30 between the latter and the conditions under which the support will be placed during 
the synthesis or the attachment of the probes or during the use of the chip. It is 
therefore necessary, before considering the use of a particular support, to consider 
characteristics such as its stability to pH, its physical strength, its reactivity and its 
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chemical stability as well as its capacity to nonspecifically bind nucleic acids. 
Materials such as glass, silicon and polymers are commonly used. Their surface is, 
in a first step, called "functionalization", made reactive towards the groups which it is 
desired to attach thereon. After the functionalization, so-called spacer molecules are 

5 grafted onto the activated surface. Used as intermediates between the surface and 
the probe, these molecules of variable size render unimportant the surface properties 
of the supports, which often prove to be problematic for the synthesis or the 
attachment of the probes and for the hybridization. 

Among the hybridization supports, there may be mentioned glass which is 

10 used, for example, in the method of in situ synthesis of oligonucleotides by 

photochemical addressing devejoped by the company Affymetrix (E.L. Sheldon, 
1 993), the glass surface being activated by silane. Genosensor Consortium 
(P. Merel, 1994) also uses glass slides carrying wells 3 mm apart, this support being 
activated with epoxysilane. 

15 The probes according to the invention may be synthesized directly in situ on 

the supports of the DNA chips. This in situ synthesis may be carried out by 
photochemical addressing (developed by the company Affymax (Amsterdam, 
Holland) and exploited industrially by its subsidiary Affymetrix (United States)) or 
based on the VLSIPS (very large scale immobilized polymer synthesis) technology 

20 (S.P.A. Fodor et a/., 1 991 ) which is based on a method of photochemically directed 
combinatory synthesis and the principle of which combines soiid-phase chemistry, 
the use of photolabile protecting groups and photolithography. 

The probes according to the invention may be attached to the DNA chips in 
various ways such as electrochemical addressing, automated addressing or the use 

25 of probe printers (T. Livache et a/., 1 994; G. Yershov et a/., 1 996; J. Derisi et a/., 
1996, and S. Borman, 1996). 

The revealing of the hybridization between the probes of the invention, 
deposited or synthesized in situ on the supports of the DNA chips, and the sample to 
be analyzed, may be determined, for example, by measurement of fluorescent 

30 signals, by radioactive counting or by electronic detection. 

The use of fluorescent molecules such as fluorescein constitutes the most 
common method of labeling the samples. It allows direct or indirect revealing of the 
hybridization and allows the use of various fluorochromes. 
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Affymetrix currently provides an apparatus or a scanner designed to read its 
Gene Chip™ chips. It makes it possible to detect the hybridizations by scanning the 
surface of the chip in confocal microscopy (R.J. Lipshutz et al, 1995). 

The nucleotide sequences according to the invention are also used in DNA 

5 chips to carry out the analysis of the expression of the Alloiococcus otitidis genes. 
This analysis of the expression of Alloiococcus otitidis genes is based on the use of 
chips where probes of the invention, chosen for their specificity to characterize a 
given gene, are present (D.J. Lockhart et a/., 1996; D.D. Shoemaker ef a/., 1996). 
For the methods of analysis of gene expression using the DNA chips, reference may, 

10 for example, be made to the methods described by D.J. Lockhart ef al. (1 996) and 
Sosnowsky et al. (1997) for the synthesis of probes in situ or for the addressing and 
the attachment of previously synthesized probes. The target sequences to be 
analyzed are labeled and in general fragmented into sequences of about 50 to 
100 nucleotides before being hybridized onto the chip. After washing as described, 

15 for example, by D.J. Lockhart et al. (1996) and application of different electric fields 
(Sosnowsky eta!., 1997), the labeled compounds are detected and quantified, the 
hybridizations being carried out at least in duplicate. Comparative analyses of the 
signal intensities obtained with respect to the same probe for different samples 
and/or for different probes with the same sample, determine the differential 

20 expression of RNA or of DNA derived from the sample. 

The nucleotide sequences according to the invention are, in addition, used in 
DNA chips where other nucleotide probes specific for other microorganisms are also 
present, and allow the carrying out of a serial test allowing rapid identification of the 
presence of a microorganism in a sample. 

25 Accordingly, the subject of the invention is also the nucleotide sequences 

according to the invention, characterized in that they are immobilized on a support of 
a DNA chip. 

The DNA chips, characterized in that they contain at least one nucleotide 
sequence according to the invention, immobilized on the support of the said chip, 
30 also form part of the invention. 

The chips preferably contain several probes or nucleotide sequences of the 
invention of different length and/or corresponding to different genes so as to identify, 
with greater certainty, the specificity of the target sequences or the desired mutation 
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in the sample to be analyzed. 

Accordingly, the analyses carried out by means of primers and/or probes 
according to the invention, immobilized on supports such as DNA chips, make it 
possible, for example, to identify, in samples, mutations linked to variations such as 
5 intraspecies variations. These variations may be correlated or associated with 
pathologies specific to the variant identified and make it possible to select the 
appropriate treatment. 

The invention thus comprises a DNA chip according to the invention, 
characterized in that it contains, in addition, at least one nucleotide sequence of a 
10 microorganism different from Alloiococcus otitidis, immobilized on the support of the 
said chip; preferably, the different microorganism is chosen from an associated 
microorganism, a bacterium of the Streptococcus family, and a variant of the species 
Alloiococcus otitidis. 

The principle of the DNA chip as explained above, is also used to produce 
15 protein "chips" on which the support has been coated with a polypeptide or an 

antibody according to the invention, or arrays thereof, in place of the DNA. These 
protein "chips" make it possible, for example, to analyze the biomoiecular interactions 
(BIA) induced by the affinity capture of target analytes onto a support coated, for 
example, with proteins, by surface plasma resonance (SPR). Reference may be 
20 made, for example, to the techniques for coupling proteins onto a solid support which 
are described in EP 524 800 or to the methods describing the use of biosensor-type 
protein chips such as the Bl Acore-type technique (Pharmacia) (Arlinghaus et a/., 
1997, Krone et a/., 1997, Chateiier etaL, 1995). These polypeptides or antibodies 
according to the invention, capable of specifically binding antibodies or polypeptides 
25 derived from the sample to be analyzed, are thus used in protein chips for the 

detection and/or the identification of proteins in samples. The said protein chips may 
in particular be used for infectious diagnosis and preferably contain, per chip, several 
polypeptides and/or antibodies of the invention of different specificity, and/or 
polypeptides and/or antibodies capable of recognizing microorganisms different from 

30 Alloiococcus otitidis. 

Accordingly, the subject of the present invention is also the polypeptides and 
the antibodies according to the invention, characterized in that they are immobilized 
on a support, in particular, on a protein chip. 
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The protein chips, characterized in that they contain at least one polypeptide 
or one antibody according to the invention immobilized on the support of the said 
chip, also form part of the invention. 

The invention comprises, in addition, a protein chip according to the invention, 
5 characterized in that it contains, in addition, at least one polypeptide of a 

microorganism different from Alloiococcus otitidis or at least one antibody directed 
against a compound of a microorganism different from Alloiococcus otitidis, 
immobilized on the support of the chip. 

The invention also relates to a kit or set for the detection and/or the 
10 identification of bacteria belonging to the species Alloiococcus otitidis or to an 
associated microorganism, or for the detection and/or the identification of a 
microorganism characterized in that it comprises a protein chip according to the 
invention. 

The present invention also provides a method for the detection and/or the 
15 identification of bacteria belonging to the species Alloiococcus otitidis or to an 
associated microorganism in a biological sample, characterized in that it uses a 
nucleotide sequence according to the invention. 

The invention also encompasses kits for detecting the presence of an 
Alloiococcus otitidis polypeptide in a biological sample. For example, the kit 
20 comprises reagents such as a labeled or labelabie compound or agent capable of 

detecting Alloiococcus otitidis polypeptide or mRNA in a biological sample; means for 
determining the amount of Alloiococcus otitidis polypeptide in the sample; and means 
for comparing the amount of Alloiococcus otitidis polypeptide in the sample with a 
standard. The compound or agent are packaged in a suitable container. The kit 
25 further comprises instructions for using the kit to detect Alloiococcus otitidis mRNA or 
protein. 

In certain embodiments, detection involves the use of a probe/primer in a 
polymerase chain reaction (PCR) (see, e.g. U.S. 4,683,195 and U.S. 4,683,202), 
such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction 
30 (LCR). This method includes the steps of collecting a sample of cells from a patient, 
isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more primers which specifically 
hybridize to an Alloiococcus otitidis polynucleotide under conditions such that 
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hybridization and amplification of the Afloiococcus otftfd/s-polynucleotide (if present) 
occurs, and detecting the presence or absence of an amplification product, or 
detecting the size of the amplification product and comparing the length to a control 
sample. 

5 

I. Transgenic Animals 

It is contemplated that in some instances the genome of a transgenic animal 
of the present invention will have been altered through the stable introduction of one 

10 or more of the Alloiococcus otitidis polynucleotide compositions described herein, 
either native, synthetically modified or mutated. As described herein, a "transgenic 
animal" refers to any animal, preferably a non-human mammal {e.g. mouse, rat, 
rabbit, squirrel, hamster, rabbits, guinea pigs, pigs, micro-pigs, baboons, squirrel 
monkeys and chimpanzees, etc), bird or an amphibian, in which one or more cells 

15 contain a heterologous nucleic acid sequence introduced by way of human 

intervention, such as by transgenic techniques well known in the art. The nucleic acid 
is introduced into the cell, directly or indirectly, by introduction into a precursor of the 
cell, by way of deliberate genetic manipulation, such as by microinjection or by 
infection with a recombinant virus. The term genetic manipulation does not include 

20 classical crossbreeding, or in vitro fertilization, but rather is directed to the 

introduction of a recombinant DNA molecule. This molecule may be integrated within 
a chromosome, or it may be extrachromosomally replicating DNA. 

The host cells of the invention are also used to produce non-human 
transgenic animals. The non-human transgenic animals are used in screening 

25 assays designed to identify infections or compounds, e.g., drugs, pharmaceuticals, 
etc., which are capable of ameliorating Alloiococcus otitidis symptoms or infections. 
For example, in one embodiment, a host cell of the invention is a fertilized oocyte or 
an embryonic stem cell into which an Alloiococcus otitidis polypeptide-coding 
sequence has been introduced. Such host cells are then used to create non-human 

30 transgenic animals in which exogenous Alloiococcus otitidis gene sequences have 
been introduced into their genome or homologous recombinant animals in which 
endogenous Alloiococcus otitidis gene sequences have been altered. Such animals 
are useful for studying the effects of an Alloiococcus otitidis polypeptide and for 
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identifying and/or evaluating modulators of Alloiococcus otiticlis polypeptide 
infectivity. 

A transgenic animal of the invention is created by introducing an Alloiococcus 
otitidis polypeptide-encqding nucleic acid sequence into the male pronuclei of a 

5 fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to 
develop in a pseudopregnant female foster animal. The human Alloiococcus otitidis 
cDNA sequence of one or more of SEQ ID NO:1 through SEQ ID NO: 4035 can be 
introduced as a transgene into the genome of a non-human animal. 

Moreover, a non-/4//o/ococct/s otitidis homologue of the Alloiococcus otitidis 

10 gene can be isolated based on hybridization to the Alloiococcus otitidis cDNA 

(described above) and used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression 
of the transgene. A tissue-specific regulatory sequence(s) can be operably linked to 
the Alloiococcus otitidis transgene to direct expression of an Alloiococcus otitidis 

15 polypeptide to particular cells. Methods for generating transgenic animals via embryo 
manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and are described, for example, in U.S. 4,736,866 and 4,870, 
009, U.S. 4,873,191 and in Hogan, 1986. Similar methods are used for production of 
other transgenic animals. A transgenic founder animal can be identified based upon 

20 the presence of the Alloiococcus otitidis transgene in its genome and/or expression 
of Alloiococcus otitidis mRNA in tissues or cells of the animals. A transgenic founder 
animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a transgene encoding an Alloiococcus otitidis 
polypeptide can further be bred to other transgenic animals carrying other 

25 transgenes. 

In another embodiment, transgenic non-human animals can be produced 
which contain selected systems that allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase system of bacteriophage 
PA. For a description of the cre/loxP recombinaste system, see, e.g., Lakso et a/., 
30 1992. Another example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gon-nan et al., 1991). If a cre/loxP recombinase 
system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. 
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Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene 
encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

5 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et a/., 1997, and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., 
a somatic ceil, from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through 

10 the use of electrical pulses, to an enucleated oocyte from an animal of the same 
species from which the quiescent cell is isolated. The reconstructed oocyte is then 
cultured such that it develops to morula or blastocyst and then transferred to 
pseudopregnant female foster animal. The offspring borne of this female foster 
animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 

15 isolated. 

All patents and publications cited herein are hereby incorporated by 
reference. 

The following examples are carried out using standard techniques, which are 
well known and routine to those of skill in the art, except where otherwise described 
20 in detail. The following examples are presented for illustrative purposes, and should 
not be construed in any way limiting the scope of this invention. 

Example l 

Confirmation of the identity of the Alloiococcus otitidis 1 1 04-9 2 isolate 

25 

The Alloiococcus otitidis isolate 1 104-92 was obtained from Dr. Richard 
Facklam of the Centers for Disease Control in Atlanta. It was isolated from the middle 
ear fluid of a child in the Atlanta, Georgia area. It was confirmed to be A. otitidis by 
comparing it to the type strain, ATCC51267, obtained from the American Type 
30 Culture Collection [Aguirre, 1 992 #1]. Both the 1 1 04-92 and type strain are 

characterized as Gram positive cocci. Both grow on Columbia agar supplemented 
with 5% yeast extract, 0.5% polysorbate 80 (Tween 80), and 0.7% phospatidyl 
choline when incubated at 37°C. On this medium, both strains form slow growing 
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small white colonies that require nearly two days to be easily observed with the 
naked eye. Both are sensitive to lysis by hen egg white lysozyme and Streptococcus 
globisporus mutanolysin. Both grow in the presence of 2% sodium azide. Both are 
killed by incubation at 55°C for 30 minutes. Finally, to further confirm that the 1 1 04- 
5 92 was a strain of A. otitidis, it was subject to polymerase chain reaction (PCR) 
identification based on its 1 6s rRNA gene. This was done using two of the primers 
specified by Aguirre and Collins [Aguirre, 1992 #2]. The antisense primer used was 
5'-ATCTTCCTGCTTGCAGGAAGAGG-3' and the sense primer was 
3'-CGCTTCATCTCTGAAGCTAGC-5\ Thus by multiple criteria, the 1104-92 
10 strain was confirmed to be an isolate of A. otitidis. 

Example 2 

Storage, growth, and harvest of Alloiococcus onnpts 1 1 04-92 for isolation 

OF DNA 

15 

The A. otitidis isolate 1 1 04-92 was stored at -70°C in Todd-Hewlett broth 
containing 40% glycerol. A small portion of the frozen stock was streaked onto the 
agar medium described in Example 1 and incubated at 37°C for two days. The 
growth from the plate was swabbed into a 17 x 100 cm tube containing 6 m! of a 

20 serum-free broth medium. This broth medium was prepared with 30 g Todd-Hewlett 
medium, 5 g yeast extract, 10 ml polysorbate 80 (Tween 80), and 1 liter distilled 
water. This medium was sterilized by autoclaving for 35 minutes. The bacteria were 
incubated aerobically without shaking in an aerobic incubator at 37°C for two days. 
The tube containing the growing bacteria was then shaken to resuspend the bacteria 

25 and added to a liter of the same medium in a Fernbach flask. This flask, in turn, was 
incubated aerobically for three days without shaking. The bacteria were harvested 
by first swirling the flask to suspend the bacteria and then low speed centrifugation at 
about 5,000 x g for 30 minutes. The pellet of bacteria was washed by resuspending 
it in 10 to 1 5 mL of phosphate buffered saline (PBS), and centrifuging the suspension 

30 at about 8,000 x g for 20 minutes. The pellet of bacteria was retained and stored 
frozen at -20°C. The yield of wet bacterial pellet was typically about 1 g per liter of 
broth. 
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Example 3 

Preparation of Alloiococcus otitidis genomic DNA 

To prepare genomic DNA, 0.95 g frozen pellet of bacteria was defrosted and 
5 suspended in 1 0 ml_ of PBS containing 1 mM MgCl 2 . The bacteria were killed by 
incubating the suspension at 55°C for 20 minutes. The suspension was allowed to 
cool before adding 25 jxl of a 10 mg/mL stock of hen egg white lysozyme and 50 jxl of 
a 25,000 unit/mL stock of Streptococcus globisporus mutanolysin to the suspension. 
It was then incubated for one hour at 37°C. Then 50 jjl! of a 1 0 mg/mL stock of 
10 RNase was added and the suspension incubated an additional hour at 37°C. After 
these incubations, sodium dodecylsulfate (SDS) was added to a final concentration 
of 0.3% (0.3 mL of a 10% stock). This was followed by the addition of 0.3 mL of a 1 
mg/mL stock of proteinase K. The suspension was then incubated for two hours at 
37°C. After this time, an equal volume of water saturated phenol/chloroformAisopropyl 
15 (25:24:1 ) was added to the digested suspension and gently mixed. The upper 
aqueous layer was retained after a low speed centrifugation and 2.5 volumes of 
ethanol were added and the tube gently inverted to mix. The DNA was then spooled 
out on a glass rod and allowed to air dry. 

The DNA at this stage still contained obvious impurities and needed further 
20 purification. The DNA dried on the glass rod was soaked in 70% ethanol to remove 
excess phenol and air-dried once again. It was then suspended in 2 ml of Tris-EDTA 
buffer to which 2 [i\ of RNase cocktail was added and incubated at room temperature 
for 75 minutes. Then 100 \x\ of protease, 100 \i\ SDS and 40 \i\ of 100 mM CaCI 2 
were added and the suspension incubated for 3.5 hours. An equal volume of 
25 chloroform was added, gently mixed, then centrifuged at a low speed. The aqueous 
layer was collected and re-extracted with the phenol, chloroform, isopropyl alcohol 
reagent In turn, the aqueous layer was extracted with chloroform. At this point, 3 M 
sodium acetate was added to the aqueous phase collected form the last extraction 
and then 3.75 ml of ethanol was added and gently mixed. The DNA was spooled out, 
30 soaked in 70% ethanol and allowed to air-dry. The DNA was finally suspended in 2 
ml of Tris-EDTA buffer. Based on absorption at 260 nm, the final yield of DNA was 
482 *ig of DNA. The DNA was confirmed to be that of A otitidis by the PCR method 
described in example 1 . This DNA was submitted for sequencing. 
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Example 4 

Cloning And Sequencing Alloiococcus otitidis Genome 

5 This invention provides nucleotide sequences of the genome of Alloiococcus otitidis 
which thus comprises a DNA sequence library of Alloiococcus otitidis genomic DNA. 
The detailed description that follows provides nucleotide sequences of Alloiococcus 
otitidis, and also describes how the sequences were obtained and how ORFs (Open 
Reading Frames) and protein-coding sequences can be identified. 

10 To construct a library, genomic DNA was hydrodynamically sheared in an 

HPLC and then separated on a standard 1% agarose gel. A fraction corresponding 
to 3000-3500 bp in length was excised from the gel and purified by the GeneClean 
procedure (B1O101, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA 

15 polymerase. The blunt-ended DNA was then ligated to unique BstX1 -linker adapters. 
These linkers are complimentary to the pGTC vector, while the overhang is not self- 
complimentary. Therefore, the linkers will not concatermertze nor will the cut-vector 
religate itself easily. The liner-adapted inserts were separated from the 
unincorporated linkers on a 1% agarose gel and again purified using GeneClean. 

20 The linker-adapted inserts were then ligated to BstX1-cut vector to construct 
"shotgun" subclone libraries. 

Only major modifications to the protocols are highlighted. Briefly, the library 
was transformed into DH10B competent cells (Gibco/BRL, DH5a transformation 
protocol). Transformed cells were detected by plating onto antibiotic plates 

25 containing ampicillin. The plates were incubated overnight at 37° C. Transformant 
clones were then selected for sequencing. The cultures were grown overnight at 
37°C. DNA was purified using a silica bead DNA preparation (Egelstein, 1996) 
method. In this manner, 25 mg of DNA was obtained per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 

30 chemistry. All subsequent steps were based on sequencing by automated DNA 
sequencing methods. The ABI dye terminator sequence reads were run on 
MegaBace™ 10000 (Amersham) machines and the data transferred to UNIX based 
computers. Base calls and quality scores were determined using the PHRED 
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software program (Ewing et aL, 1998, Genome Res. 8: 175-185; Ewing and Green, 
1998, Genome Res. 8:685-734). Reads were assembled using PHRAP (P. Green, 
Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, Jan. 
1996, p 157) with default program parameters and quality scores. 

5 To identify Alloiococcus otitidis genome encoded polypeptides, the complete 

genomic sequence of Alloiococcus otitidis was analyzed essentially as follows: First, 
all possible stop-to-stop open reading frames (ORFs) > 222 nucleotides in all three 
reading frames were translated into amino acid sequences. 

Second, the identified ORFs were analyzed for homology to known protein 

10 sequences. Third, the coding potential of non-homologous sequences were 

evaluated with the GENEMARKTM software program (Borodovsky and Mclninch, 
1993, Comp. Chem. 17:123). The results of these analysis are set forth in tables 2- 
16. 

15 Example 5 

Identification of specific genes in Alloiococcus otitidis 

Alloiococcus otitidis homologs of the genes listed in Table 4 were identified as 

follows: 

20 Protein sequences of interest ("query sequences", Table 4) were extracted 

from Genbank from one or more species; query species included but were not limited 
to Staphylococcus aureus, Streptococcus pnuemoniae, Streptococcus pyogenes, 
Lactococcus lactis, Escherichia coli, and Bacillus subtilis. These queries were 
compared to the Alloiococcus otitidis sequence by several methods in order to 

25 determine which Alloiococcus sequence was the ortholog for the query gene. 

First, the query sequences were compared to the translated Alloiococcus 
otitidis ORF set using BLASTP. The ORF set was generated as described in 
Vaccines patent, except that for each ORF that had multiple potential start codons, 
only the longest ORF was used. The top 1 0 Alloiococcus otitidis hits for each query 

30 were saved, without regard to score. 

These Alloiococcus otitidis hits were then compared to NR, the nonredundant 
Genpept database, using BLASTP. An Alloiococcus otitidis ORF was considered the 
ortholog of a query sequence if the genes were reciprocal best hits in Alloiococcus 
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otitidis and the query genome. This analysis is also sumarized in Table 4 (excel file 
AOT_PATENT_FILE.xls, Sheet TopHitsAndClustalKey). Specific numerical cutoffs 
were not used; however all top hits had Expect values of less than 3x1 (T 28 . 

Several query sequences had more than one high-scoring hit in Alloiococcus 

5 otitidis. In most cases, however, only the first, best hit to the original query sequence 
had that query sequence as its reciprocal best hit. For example, the Streptococcus 
pyogenes query sequence GyrA (alpha subunit of DNA gyrase) has two high-scoring 
hits in Alloiococcus otitidis. These were distinguished by the reciprocal blast 
analysis; the first, ORF_505 (60% identity, Expect = 0) is the GyrA homolog and the 

10 second, ORF_1907 (38% identity, Expect = 1 x 10 ~ 154 ) is the homolog of the query 
sequence GrIA or ParC (topoisomerase IV, A subunit). Other examples of closely 
related proteins include the B subunits of DNA gyrase (GyrB) and Topoisomerase IV 
(GrIB or ParE); and YphC and Era, both of which are putative GTP binding proteins 
of unknown function. These Alloiococcus otitidis ORFS were assigned based on 

15 their top hit in Genpept. 

» 

In two cases the multiple high-scoring hits in Alloiococcus otitidis were the 
result of gene duplication. In the case of MurA (UDP-N-acetylglucosamine 
enolpyruvyl transferase) two separate Alloiococcus otitidis ORFS were determined to 
be the desired orthologs, because both had MurA (or MurZ, alternate notation) as 

20 their best hit in Genpept. Likewise, there are two FolC (folylpolyglutamate synthase) 
homologs in Alloiococcus otitidis. It is known that other bacteria, particularly Gram- 
positive bacteria, may carry two homologs of each of these genes. 

As a further step in verification of gene assignments, the Alloiococcus otitidis 
ORFS identified as orthologs of the query genes by the analysis above were then 

25 compared to an internal copy of the COGS database (Tatusov RL, Natale DA, 

Garkavtsev IV, Tatusova TA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, 
Fedorova ND, Koonin EV, 2001 , Nucleic Acids Res 2001 Jan 1 ;29(1 ):22-8. The 
COG database: new developments in phylogenetic classification of proteins from 
complete genomes) using BLASTP. The COGS database is a curated set of proteins 

30 from a set of finished bacterial genomes, which have been grouped into specific 
protein families on the basis of protein similarity. In all cases, the Alloiococcus 
otitidis ORF was most closely related to the COGS family of the initial query protein, 
if that protein had been assigned to a COGS family. Examples of proteins for which 
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there is no COGS family defined (in our local version of the database) include SrtA 
(sortase) and MvaK1 (phosphomevalonate kinase). 

As a final confirmation, all query proteins were compared to the complete 
Alloiococcus otitidis nucleotide sequence using TBLASTN, in order to determine if 
5 there were additional and/or better hits that had not been predicted as ORFS. In all 
cases, the same sequence was identified as the best hit by TBLASTN and by 
BLASTP. 

For one query sequence, sortase, the Alloiococcus otitidis ORF that was the 
top hit (Expect = 0.42) by the initial BLASTP or TBLASTN using the Staphylococcus 

10 aureus sortase sequence as query was found by additional analysis (reciprocal blast) 
to be a putative ABC-transport protein. The true sortase homolog in Alloiococcus 
otitidis was identified by construction of a Hidden Markov Model based on a multiple 
alignment of 72 known and putative sortase proteins that had been identified 
previously using similar computational methods. The model was constructed using 

15 "hmmbuild" and the Alloiococcus otitidis ORF set was searched using "hmmsearch", 
both of the hmmer package (S.R. Eddy. Profile hidden Markov models. 
Bioinformatics 14:755-763, 1998). The assignment of ORF_876 as sortase was then 
confirmed by reciprocal blast as described above and in Table 2. ORF„876 was also 
found to be the top hit in Alloiococcus otitidis when the Bacillus subtilis putative 

20 sortase (YhcS) was used as the query sequence in a BLASTP search. The Bacillus 
halodurans BH3596 Bacillus subtilis YhcS and proteins that are the top hits for 
RF_876 have recently been placed into a COGS group of sortases, further 
confirming the identity of ORF_876 as the Alloiococcus otitidis sortase. 
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Example 6 

Identification of the gene encoding Coenzyme A (CPA) in Alloiococcus 

OT777Q/S 

10 

Pantothenate kinase (PanK, CoaA) encoded by the coaA gene catalyzes the 
initial step in Coenzyme A (CoA) biosynthesis. CoA is an essential co-factor in a 
number of metabolic pathways in bacteria and mammals. Short-chain thioesters 
15 such as acetyl-CoA and succinyi-CoA are essential intermediates in carbon 

metabolism. CoA-thioesters of long chain fatty acids feed into p-oxidation and are 
also the source of fatty acids for phospholipids. In addition, CoA and its thioesters 
play important roles in the regulation of several enzymes in intermediary metabolism, 
including pyruvate dehydrogenase and phosphoenol pyruvate carboxylase. Finally, 



-77- 



WO 03/104391 



PCT/US02/36122 



synthesis of holo acyl carrier protein (ACP) is dependent on CoA for the 4'- 
phosphopantetheine moiety linked to ACP. ACP is essential for fatty acid 
biosynthesis. The two major acyl-carrier groups in cells: CoA and ACP, are derived 
from pantothenate. Pantothenate can be obtained exogenously through uptake via a 

5 permease, the product of the panF gene. Alternately, pantothenate is the product of 
condensation of pantoate and 0-alanine via pantothenate synthetase, the product of 
the panC gene. The initial step in CoA biosynthesis is the phosphorylation of 
pantothenate by pantothenate kinase (PanK, CoaA). 

The coaA gene was originally identified by Dunn and Snell in S. typhimurium 

10 as a temperature sensitive allele. Similarly, a temperature sensitive allele of coaA 
was reported for E. coli in 1 987. CoaA was found to be essential in E coli in a 
recent genetic footprinting analysis. In the temperature sensitive strains, 
accumulation of phosphorylated CoA intermediates rapidly ceased following shift to 
the non-permissive temperature. CoaA was shown to be a homo-dimer of 35 kDa 

15 subunits that bound ATP cooperatively. ATP is bound first in a sequential 

mechanism of action; CoA has been shown to be a potent inhibitor of the reaction 
and competitively competes for binding with ATP. Therefore CoaA is under feedback 
regulation and is the major regulatory step in CoA biosynthesis. 

. Lysine 101 in bacterial pantothenate kinase (CoaA) was found to be essential 

20 for both ATP and CoA binding. This supports kinetic data that CoA is a competitive 
inhibitor of ATP binding to CoaA and that both substrates bind to the same site. 

Homologues of E coli CoaA have been identified in B. subtilis, S. pyogenes, 
M. tuberculosis, H. influenzae and V. cholerae. Homologues have not been identified 
in either the S. cerevisiae genome or in a mammalian expressed sequence tag 

25 database. Calder et a/, identified a homologue, through functional complementation 
of an E. coli coaA ts mutant, in A. nidulans. Homologue of this gene identified in 
Ailoiococcus otitidis as described in Example 5 (Seq. ID No 47. The protein encoded 
by the gene is set forth in Seq. ID No. 48. 

The A. nidulans gene was then used to identify a yeast homologue. The 

30 bacterial and Aspergillus enzymes were found to be 16% identical and 32% similar. 
Although this level of similarity is quite weak the essential lysine residue involved in 
nucleotide binding appears to be conserved; however, the sequence surrounding the 
lysine residue were not conserved and further study will be required to validate this 
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finding. The most striking difference between the eukaryotic and prokaryotic 
enzymes is found in the sensitivity of each to competitive inhibition by CoA and 
acetyl-CoA. The yeast enzyme was most sensitive to acetyl-CoA and less sensitive 
to CoA, whereas the converse was true for the bacterial enzyme. Later studies 
5 demonstrated that mammalian pantothenate kinase is activated by CoA and inhibited 
by acetyl-CoA. 

* 

Nucleotide binding 

Binding of ATP to CoaA is directly demonstrated by equilibrium dialysis 
10 employing the non-hydrolyzabie ATP analogue ATPyS. The Kd measured for ATP 
binding is reported to be 2.1 pM. 

CoA binding 

Binding of CoA to CoaA is directyl demonstrated by equilibrium dialysis and 
15 the Kd is reported to be 6.7 pM. 

Pantothenate kinase activity 

Specific kinase activity of CoaA is demonstrated using D-[1 - 14 C]pantothenate and 
capturing 4'-phospho[1- 14 C]pantothenate on DE81 filters. Using this assay the 
20 following kinetic values were derived: specific activity - 470+/- 200 nmol/min/mg; 
pantothenate Km - 36 pM; K™ ATP - 136 pM. 

Suitability of target for anti-infective development 

Coenzyme A biosynthesis is essential for bacterial viability. CoaA catalyzes the 
25 first step of biosynthesis of CoA and appears to be the point of regulation for the 
pathway. The essentiality of CoaA is demonstrated through the construction of 
temperature sensitive alleles in coaA Although the yeast enzyme is found to- 
functionally complement the bacterial temperature sensitive allele, sequence and 
kinetic differences suggest the possibility of identifying inhibitors of the bacterial 
30 enzyme with high selectivity. As CoaA is essential and conserved in gram-negative 
and gram-positive pathogens, such inhibitors will have broad-spectrum utility. 
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Suitable assays for measuring CoaA function 

CoaA is purified by standard methods using widely available molecular tags 
following expression at high level from E. coli. Pantothenate kinase activity is 
.measured as follows: CoaA and D-[1- 14 C]pant6thenate is incubated in a buffer 
5 consisting of 100 mM Tris (pH 7.4), 2.5 mM MgCI 2 , 2.5 mM ATP for 5-60 minutes at 
37'C. Product, 4'-phospho[1 - 14 C] pantothenate, is monitored through retention of 
labeled material on DE81 filters. This assay is amenable to high-throughput 
screening using high-density well-filter plates. 

io Example 7 

Identification of the gene encoding CoaBC (Dfp) in Alloiococcus otitidis 

The E. coli dfp gene, which encodes the previously designated Dfp protein, 
was originally identified as encoding an enzyme required for CoA biosynthesis. The 

15 gene, coding for the protein of interest, was renamed coaBC to reflect the enzyme 
function in CoA biosynthesis. CoA is an essential co-factor in a number of metabolic 
pathways in bacteria and mammals. Short-chain thioesters such as acetyi-CoA and 
succinyl-CoA are essential intermediates in carbon metabolism. 

CoaBC carries out the second and third steps of coenzyme A 

20 biosynthesis: the conjugation of 4'-phosphopantetheate with cysteine by the CoaB 
(PPCS : 4'phosphopantethenoyl cysteine synthase) activity followed by the 
conversion to 4-phosphopantetheine by the CoaC (PPCDC: 
4'phosphopantenoylcysteine decarboxylase) activity. Homologue of this gene 
identified in Alloiococcus otitidis as described in Example 5 (Seq. ID No 77). The 

25 protein encoded by the gene is set forth in Seq. ID No. 78. 

Enzyme activity of CoaBC (Dfp): 

Initially it was demonstrated that Dfp enzyme catalyzing oxidative 
30 decarboxylation of (R)-4'-phospho-N-pantothenoylcysteine (PPC) to form 4'- 

phosphopantetheine (PP) - the third step in CoA biosynthesis from pantothenate 
The Km for this reaction is 800 jxM for PPC. 

Subsequently, it was established that Dfp is a bifunctional enzyme, catalyzing 
the second step of CoA biosynthesis, coupling of 4'-phosphopantothenate with 
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cysteine to form PPC, as well. This reaction is a two-step process and requires CTP 
for initial 4'-phosphopantothenate activation. Second step couples cysteine to the 
phosphopantothenate moiety with a release of CMP. Estimated Km s are 300 m-M for 
4'-phosphopantothenate and CTP, and 250 jiM for cysteine. 

5 

CoaBC as target for antibacterial development. 

Coenzyme A (CoA) plays a vital role in the metabolism of living cells. 
According to a recent report, 4% of all enzymes in the cell require CoA, its thioesters 

10 or 4'-phosphopantetheine. Recent genetic footprinting experiments on E. coliand 
direct gene knockout have established that this coaBC is essential for bacterial 
growth. Homologs of coaBC have been identified in a number of gram-positive and 
gram-negative organisms, which suggested the possibility of developing a broad- 
spectrum antibacterials from coaBC inhibitors. Considering the bif unctional nature of 

15 CoaBC, it is feasible to identify inhibitors that will inhibit both enzymic functions, thus 
arresting two steps in the CoA pathway. Another important factor in favor of 
selecting CoaBC as a target for antibacterials is low homology of the bacterial 
enzyme to eukaryotic counterparts. In most of the higher organisms including 
humans, two separate enzymes carry out these functions. Moreover, mammalian 

20 (R)-4'-phospho-N-pantothenoylcysteine decarboxylase is a pyruvate-dependent 
enzyme, while CoaBC requires flavine mononucleotide for its function. 

Assays for measuring CoaBC activity. 

25 PPC synthetase activity is be monitored by detecting the released 

pyrophosphate. This is achieved by converting pyrophosphate to inorganic phospate 
with pyrophosphatase and detection by the Malachite Green assay, or by the MESG 
assay spectrophotometrically. CoaBC (2 \ig) is incubated in the reaction buffer 
containing 10 mM DTT, 2 mM MgCI 2 , 50 mM Tris-HCI, pH 8, 300 p.M 4- 

30 phosphopantothenate, 3.5 mM CTP, 5 \ig pyrophosphatase. The reaction is started 
by addition of appropriate amount (1 0-500 piM final) of cysteine. The reaction is 
stopped at different time points by addition of equal volume of 5M H 2 S0 4 . The 
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10 



amount of inorganic phosphate released will be determined according to the one of 
described techniques. 

PPC synthetase activity is also monitored by detecting the release of carbon 
dioxide from 14 C-labeled cysteine. CoaBC (2 \ig) is incubated in the reaction buffer 
containing 10 mM DTT, 2 mM MgCI 2 , 50 mM Tris-HCI, pH 8, 2.5 yM 4'- 
phosphopantothenate, 3.5 mM CTP. The reaction is started by addition of 
appropriate amount (30 mM, final concentration) of 14 C-labeled cysteine. The 
reaction is stopped at different time points by addition of equal volume of 5M H 2 S0 4 . 
Amount of released 14 C-labeled C0 2 is determined according to published technique. 

Example 8 

Identification of the gene encoding phosphopantetheine adenvlvltransferase 
15 (CoaD) in Alloiococcus otitidis 

Phosphopantetheine adenylyltransferase, (PPAT, CoaD, KdtB) catalyzes the 
penultimate step in Coenzyme A (CoA) biosynthesis. The fourth step in CoA 

20 biosynthesis is the addition of AMP to 4'-phosphopantetheine by phosphopantetheine 
adenylyltransferase (CoaD) to form 3' dephospho-CoA (dPCoA). 

The coaD gene was first identified in E. col! by Geerlof et al. CoaD is 
essential for viability in E. coli and S. aureus. The enzyme has a mass of 1 8 kDa and 
was determined to be a hexamer through cross-linking studies. Crystallography 

25 confirmed the oligomeric state of the enzyme. Moreover, co-crystallography of CoaD 
with dPCoA has also been carried out mapping the binding pocket for the major 
product of the reaction. Interestingly, in mammals PPAT has been shown to be in a 
complex with dephospho Coenzyme A kinase (dPCoA kinase, DPCK). This enzyme, 
purified from pig liver, is referred to as CoA Synthase. The yeast PPAT is associated 

30 with a protein complex that is in excess of 375 kDa and composed of six proteins. 
There is no detectable homology between the bacterial PPAT (CoaD) and the 
recently identified human PPAT, the activity of which is contained in a bif unctional 
PPAT/DPCK enzyme. Homologues of £ coli CoaD have been identified in P. 
aeruginosa, S. pneumoniae, S. aureus, H. influenzae, H. pylori, B. anthracis and M. 

35 tuberculosis, Homologue of this gene identified in Alloiococcus otitidis as described in 
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Example 5 (Seq. ID No 81). The protein encoded by the gene is set forth in Seq. ID 
No. 82. 

Enzyme activity 

5 CoaD (PPAT) carries out the reversible transfer of AMP to 4'- 

phosphopantetheine, forming dephosphocoenzyme A and releasing PPi. The 
reverS e reaction was demonstrated by Geerlof et a/, using a coupled assay to tie 
ATP production to NADP reduction, which is monitored at 340 nm. The following 
kinetic constants were calculated: kcat = 3.3 +/- 0.1 /sec; K m(dPC oA) = 7.0 +/- 1 .4 uM; 

10 Kmtppi) = 0.22 +/- 0.04 mM. 

CoaD as target for anti-infective development. 

Coenzyme A biosynthesis is essential for bacterial viability. CoaD, 
15 phosphopantetheine adenylyltransferase, catalyzes the fourth step in the pathway 
and was shown to be essential in both E. coli and S. aureus. There is no measurable 
homology between CoaD and the human PPAT enzyme, so the liability of poorly 
selective compounds is quite low. As CoaD is essential and conserved in gram- 
negative and gram-positive pathogens, inhibitors developed against this target will 
20 have broad-spectrum utility. 

Assays for measuring CoaD function 

CoaD will be expressed and purified using standard methodologies for 
bacterial expression and affinity tag-based, purification. Two assay formats can be 

25 used to monitor enzymatic activity: the forward reaction and the reverse reaction. 

The forward reaction assay was initially described for measuring the activity 
of the human PPAT activity in the PPAT/DPCK enzyme. The enzyme assay is 
carried out in 50 mM Tris (pH 8.0), 2 mM MgCI 2 , 5 mM ATP, 5-500 uM 4'- 
phosphopantotheine, 7.5 mM NADH and enzyme (initially 0.1 - 1.0 pg/ml). The 

30 production of PPi is detected using the protocol of O'Brien in which PPi production is 
coupled to the oxidation of NADH to NAD. This system requires the addition of 4 
enzymes (PPj-dependent phosphofructokinase, aldolase, triosephophate isomerase 
and glycerol-3-P dehydrogenase) to the basic reaction mix and presents the added 
issue of deconvolution, which limits the use of the assay as a primary screen. 
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The reverse direction assay is carried out also as a coupled assay to tie ATP 
production to NADP reduction following the method described by Larnprecht & 
Trautschold. The assay is set up in reaction buffer containing the following: 50 mM 
Tris (pH 8.0), 1 mM DTT, 2 mM MgCI 2 , 1 mM NADP, 5 mM glucose, 2 mM PP ti 0.1 
5 mM dPCoA. Hexokinase (4 units) and glucose-6-phosphate dehydrogenase (1 unit) 
will be added to the assay as the coupling enzymes in addition to CoaD (initially 0.1 - 
1 pg/ml). The assay is monitored at 340 nm. Deconvolution of hits is required with 
this assay, however with only 2 additional enzymes the task will be less cumbersome 
when compared to the forward assay described above. 

10 

Example 9 

Identification of the gene encoding pephosphqCoA kinase (DPCK, YacE, 

CoaE) in Alloiococcus otitidis 

15 

DephosphoCoA kinase (DPCK, YacE, CoaE) encoded by the coaE gene 
catalyzes the final step in Coenzyme A (CoA) biosynthesis. The final step in CoA 
biosynthesis is the phosphorylation of the 3'-hydroxyI group of dephospho-CoA to 

20 form CoA by dephosphocoenzyme A kinase (DPCK, YacE, CoaE). 

The determination that the previously identified yacE gene encoded the 
dephosphocoenzyme A kinase activity was reported by Mishra et al. These authors 
previously determined that separate enzymes encode the phosphopantetheine 
adenyltransf erase (PPAT) and dephosphocoenzyme A kinase (DPCK) activity in 

25 Corynebacterium ammoniagenes in contrast to the eukaryotic enzymes in which the 
PPAT and DPCK activities are coupled. The E. coli gene, encoding a 25 kDa 
protein, was cloned based on the sequence of the C. ammoniagenes gene and found 
to be identical to the previously described yacE gene. The gene was designated 
coaE to follow existing nomenclature in E. coli. CoaE (YacE) was shown to be 

30 essential in E. coli through genetic footprinting. CoaE is widely distributed in 

bacteria. Homologue of this gene identified in Alloiococcus otitidis as described in 
Example 5 (Seq. ID No 93). The protein encoded by the gene is set forth in Seq. ID 
No. 94. 
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Assays for measuring CoaE function 

CoaE carries out the phosphorylation of dephosphocoenzyme A at the 3' 
hydroxy! group, consuming ATP, to form CoA. Dephosphocoenzyme A kinase 
activity is measured in a coupled reaction in which NADH oxidation to NAD is tied to 

5 ADP production. In this assay, the standard pyruvate kinase/lactose dehydrogenase 
coupling system is used to generate NAD in a 1 :1 molar equivalent to the ADP 
produced by the test enzyme. NADH oxidation to NAD is monitored at 340 nm in a 
standard spectrophotometer. The following kinetic values were determined for CoaE: 
Km (atp) = 0.74 mM; Km ( d ep hospho-coA) = 0.14 mM (7). 

10 The formation of CoA is monitored using a coupled enzyme system in which 

acetyl-CoA is formed in proportion to the amount of CoA in the assay. Three 
enzymes (phosphate acetyl transferase, citrate synthase and malate dehydrogenase) 
are added to the reaction that results in the formation of NADH from NAD, which is 
monitored at 340 nm. 

15 

CoaE as a target for anti-infective development 

Coenzyme A biosynthesis is essential for bacterial viability. CoaE, 
dephosphocoenzyme A kinase, catalyzes the final step in CoA synthesis and is . 
shown to be essential by genetic footprinting in E. coli. A degree of homology 

20 between CoaE and the human DPCK enzyme has been noted, such that selectivity 
assays is necessary to determine a high therapeutic index for CoaE inhibitory 
compounds. CoaE is conserved in gram-negative and gram-positive pathogens and 
should have broad-spectrum utility in the clinic. 

CoaE is expressed and purified using standard methodologies for bacterial 

25 expression and affinity tag-based purification. DephosphocoA kinase activity is 
monitored using a coupled enzyme system to tie ADP production to oxidation of 
NADH to NAD. The decay of absorbance at 340 nm will be the assay readout. The 
assay will be setup in the following buffer: 50 mM Tris (pH 8.5), 20 mM KCI, 10 mM 
MgCI 2 , 10 mM ATP, 0.3 mM NADH and 0.4 mM phosphoenoipyruvate. The coupling 

30 enzymes: pyruvate kinase (10 U) and lactate dehydrogenase (4 U) will be added 

along with dephosphocoenzyme A kinase (initially 0.1- 1 .0 ug/ml). The assay will be 
started by the addition of 0.4 mM dephosphocoenzyme A. In this assay system, the 
release of ADP is tied to the oxidation of NADH to NAD, and is monitored at 340 nm. 
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This assay is transferable to a high-density microtiter plate format and suitable for 
HTS. 

Example 10 

5 Identification of dnaB and pcrA, genes encoding helicases in Alloiococcus 

otitidis 

Helicases unwind double-stranded DNA in a reaction that couple nucleotide 
binding and hydrolysis to strand unwinding. Their activity is required for a number of 
10 biological processes such as separation of the chromosome during replication, 

recombination and repair. Homologue of thse genes were identified in Alloiococcus 
otitidis as described in Example 5 (Seq. ID No 15 and 99). The protein encoded by 
the gene is set forth in Seq. ID No. 16 and 100. 

15 Due to the essentia! roles modulated by these molecules they represent an 

important target for antibacterial therapy. Homologs of dnaB and pcrA genes 
encoding helicases were identified as described in Example 5. A primary assay, 
which detects helicase function in vitro, is used to identify inhibitors of each enzyme 
and is described below. 

20 Genes encoding DnaB and PcrA is obtained using polymerase chain reaction 

amplification of the genomic region encoding them. The genes is subcloned into a 
standard expression vector either containing an amino acid tag for ease of 
purification or not. The enzyme is then over-expressed in Escherichia coliand 
purified using a standard tag system. 

25 Most helicases require a region of single-stranded DNA flanking the duplex 

region that it unwinds. As a result, providing a single stranded region to either the 3' 
or 5* end of a duplex allows for determination of the polarity of helicase unwinding. 
These types of experiments have demonstrated that PcrA and DnaB are 3'-5' and 5'- 
3' helicases, respectively. None the less, a convenient filtration assay has previously 

30 been described that is formatted for high-through-put screening of inhibitors of either 
enzyme, regardless of polarity. Assays (90 ul) contained 15 pM single-stranded M13 
DNA to which a radiolabeled oligonucleotide had been annealed as a substrate for 
unwinding. Reactions are carried out in 96-weil GF/C untfilter hydrophobic plates 
(Polyfiltronics Inc.) in 70 ul helicase buffer [20 mM Hepes (pH 7.6), 4 mM MgCI 2 4 
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mM ATP, 100 ug/ml BSA, 5% glycerol and 2 mM DTT] and 10 ul of DMSO or 
compound. Reactions are initiated by adding 10 ui of purified helicase protein and 
are incubated for 1 hr at room temperature. 1 00 u! of 2X capture buffer containing 
silica beads [25% methanol, 3 M Nal, 0.03% NP-40, and 10% GlassFog beads 
5 (BIO101)] were added. The mixture was incubated for 30 min at room temperature. 
Plates are then washed 5X on a Bio-Teck instruments, Auto Washer EL403) with 
wash buffer (50% ethanol, 0.2% NP-40 and 50 mM NaCI). Scintillation fluid was 
added and plates are counted (Packard Topcount). 

iq Example 11 

Identification of dnaE. the gene encoding DnaE-polymerase in Alloiococcus 

OTlTIDtS 

DnaE is an enzyme that catalyzes the DNA template directed polymerization 
15 of deoxyribonucleotides into deoxyribonucleic acid. The enzyme has been reported 
to modulate lagging strand synthesis at gram-positive replication forks. Functions for 
DnaE have been defined biochemically, in Bacillus subtilis and Streptococcus 
pyogenes. Homologue of this gene identified in Alloiococcus otitidis as described in 
Example 5 (Seq. ID No 75). The protein encoded by the gene is set forth in Seq. ID 
20 No. 76. - 

Because DnaE is an essential protein in gram-positive bacteria and has high 
homology to the gram-negative dnaE, which is an essential polymerase subunit of 
the DNA polymerase III holoenzyme, it serves as a good target for antibacterial drug 
discovery. A primary assay, which detects processive DnaE mediated DNA 
25 synthesis in vitro, is useful identify inhibitors of the enzyme and is described below. 

The gene encoding DnaE I in Alloiococcus otitidis was identified as described 
in Example 5. Purification of DnaE DNA polymerase from Alloiococcus, The gene 
encoding DnaE is obtained using polymerase chain reaction amplification of the 
dnaE gene. The gene is subcloned into a standard expression vector either 
30 containing an amino acid tag for ease of purification or not. The enzyme is then 
over-expressed in Escherichia coliand purified using a standard tag system. 

Because DnaE catalyzes the incorporation of single deoxyribonucleotides into 
DNA, the incorporation of radiolabeled deoxyribonucleotides into larger 
deoxyribonucleic acid molecules is monitored to measure activity of the enzyme. A 
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filtration assay has been previously described for Streptococcus pyogenes DnaE that 
uses filterplates containing DE81 filters to capture polymerized DNA. This assay is 
amenable to high-through-put screening format for DnaE. Assays contained 70 ng of 
30-mer primed M13mp1 8 single stranded DNA as a template for replication. The 
reaction contained 3.3-300 ng of DnaE in 23.5 pi of replication buffer [20 mM Tris- 
HCL (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 mM DTT, 2 mM ATP, 8 mM MgCI 2 , 40 
pg/ml BSA] and 60 pM of both dGTP and dCTP. NaCI was added to the reaction 
mixture to a final concentration of 40 mM. DNA synthesis was initiated by the 
addition of 1 .5 pi of 1 .5 mM dATP and 0.5 mM [p- 32 P]dTTP. Reactions were 
incubated at 37°C for various lengths of time and were quenched by adding an equal 
volume of 1% SDS and 40 mM EDTA. One-half of the terminated reaction was 
applied to DE81 filter paper and washed 3X with wash solution (0.3 M Ammonium 
formate and 0.01 M Sodium pyrophosphate). Filters were then placed in scintillation 
vials and 1 ml scintillation counting liquid was added. Radioactivity was counted 
using a scintillation counter. 

Example 12 

Identification of dnaG. the gene encoding primase in Alloiococcus otitidis 

DnaG is an enzyme that catalyzes the DNA template directed polymerization 
of ribonucleotides into ribonucleic acid de novo . Ribonucleic acid molecules that are 
synthesized by DnaG primase subsequently serve as primers for synthesis of the 
leading- and lagging-strands during chromosomal replication. Functions for DnaG 
have been defined biochemically, and the crystal structure of the RNA polymerase 
domain has been determined in Escherichia coli. Homologue of this gene identified in 
Alloiococcus otitidis as described in Example 5 (Seq. ID No 63). The protein encoded 
by the gene is set forth in Seq. ID No. 64. 

Because DnaG primase plays an essential role in both leading- and lagging- 
strand synthesis during chromosomal replication, and DnaG has homologs in all 
prokaryotes but not eukaryotes, it serves as a good target for antibacterial drug 
discovery. A primary assay, which detects DnaG mediated RNA synthesis in vitro, 
can be used to identify inhibitors of the enzyme and is described below. 
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Assay for the activity of DNA polymerase and identification of compounds that 
inhibit DnaG 

The gene encoding DnaG is obtained using polymerase chain reaction 
5 amplification of the dnaG gene. The gene is subcloned into a standard expression 
vector either containing an amino acid tag for ease of purification or not. The 
enzyme is then over-expressed in Escherichia coii and purified using a standard tag 
system. 

Because DnaG catalyzes the incorporation of single ribonucleotides into RNA, 
10 the incorporation of radiolabeled ribonucleotides into larger ribonucleic acid 
molecules is monitored to measure activity of the enzyme. A high-throughput 
scintillation proximity assay (SPA) assay, previously described for E. co//DnaG, is 
used to meadure activity of DnaG activity in a coupled reaction with DnaB helicase. 
The assay, which was shown to work with DnaG alone, is used to screen for 
15 compounds that inhibit DnaG function. Assays are run in 96-weli Packard Optiplate 
plates. First, 1 pi DMSO or test compound was added, followed by 20 pi of DnaG 
(208 nM) and 3.3 nM M13mp18 single-stranded DNA. Reactions are initiated by 
adding 10 ul of primase assay buffer [50 mM Tris-HCl (pH 7.5), 4% sucrose, 8 mM 
DTT, 5 mM MgCI 2 , 40 ug/ml BSA, 0.1 pg/ul Rifampicin, 25 U/ml RNA guard, 100 pM 
20 GTP, 100 pM UTP, 3 pM CTP, 1 mM ATP] and 0.4 pCi [ 3 H]CTP. Reactions are 
incubated at 30°C for 30 min. Next, a suspension of 50 pi of 2.5 mg/ml PVT-PEI 
SPA beads (Amersham; prepared in 0.3 M NaCitrate, pH 3.0) were added. Plates 
were read after 1 hr on a Topcount instrument (Packard). 

25 Example 13 

DnaN, DnaX, HolA, HqlB, and PolC. the genes encoding the suBUNrrs OF 

ALLOIOCOCCUS OTITIDiS DNA POLYMERASE III HOLOENZYME: BETA (B), TAU (T), DELTA 

(A), DELTA' (A') AND POLC. 

30 

DNA polymerase 111 holoenzyme is an enzyme complex comprised of multiple 
highly conserved subunits that catalyzes the DNA template directed polymerization of 
deoxyribonucleotides into deoxyribonucleic acid. In gram positive organisms the 
holoenzyme is composed of a polymerase subunit, PolC, and accessory proteins. 
35 The accessory proteins act in a coordinated manner to clamp the polymerase tightly 
to the DNA template allowing the polymerase to synthesize DNA with high speed and 
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processivity. Homologue of these genes identified in Alloiococcus otitidis are 
described in Example 5 (Seq. ID Nos. 21 , 105, 79, 103, and 105 respectively). The 
protein encoded by the gene is set forth in Seq. ID No. 22, 106, 80, 104 and 106 
respectively). 

5 Functions for the individual subunits have been defined biochemically and 

interactions between them have now been deduced structurally by crystallographic 
analysis of the enzyme from Escherichia coli. Tau interacts directly with both delta 
and delta' to form a clamp loader complex. Upon binding ATP the complex 
undergoes a conformational change altering an interaction between delta and delta', 

10 which allows delta to subsequently interact with the beta-clamp. The beta-clamp is a 
ring-shaped.homomultimer assembly that can be opened by delta and placed onto a 
primed DNA template. ATP hydrolysis results in closing the clamp around DNA and 
dissociation of the clamp-loading complex. PolC then couples with the beta clamp to 
form a highly processive polymerase. 

15 Because DNA polymerase 111 holoenzyme is comprised of multiple subunits, 

the opportunity exists to inhibit its activity at a number of different sites. A primary 
assay, which detects processive DNA synthesis in vitro, can be used to identify 
inhibitors of the enzyme and is described below. Deconvolution of inhibitors, based 
on either activity of physical interaction, follow the primary assay. 

20 

Assay for the activity of DNA polymerase 

Purification of DNA polymerase III holoenzyme subunits from Alloiococcus. 
Genes encoding the subunits of DNA polymerase is obtained using polymerase 
chain reaction (PCR) amplification of the genomic region encoding them. The genes 
25 are subcloned into a standard expression vector either containing an amino acid tag 
for ease of purification or not. The enzyme is then over-expressed in Escherichia coli 
and purified using a standard tag system. 

Because DNA polymerase III catalyzes the incorporation of single 
deoxyribonucleotides into DNA, the incorporation of radiolabeled deoxynucleotides 
30 into larger deoxyribonucleic acid molecules is monitored to measure activity of the 
enzyme. A filtration assay is previously described for Streptococcus pyogenes DNA 
polymerase III that uses filterplates containing DE81 filters to capture polymerized 
DNA (2). This assay is amenable to high-through-put screening format. Assays 
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contained 70 ng of 30-mer primed M13mp18 single stranded DNA as a template for 
replication. The reaction contained 43 ng of R and 140 ng of PoIC-taa' complex in 
23.5 Ml of replication buffer (20 mM Tris-HCL (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 
mM DTT, 2 mM ATP, 8 mM MgCI 2 , 40 ug/ml BSA, and 60 14M of both dGTP and 

5 dCTP. DNA synthesis was initiated by the addition of 1 .5 ul of dATP and [ul- 

32 P]dTTP. Reactions were incubated at 37°C for various lengths of time and were 
quenched by adding an equal volume of 1% SDS and 40 mM EDTA. One-half of the 
terminated reaction was applied to DE81 filter paper and washed 3X with wash 
solution (0.3 M Ammonium formate and 0.01 M Sodium pyrophosphate). Filters were 

10 then placed in scintillation vials and 1 ml scintillation counting liquid was added. 
Radioactivity was counted using a scintillation counter. 

Compounds inhibiting PolC subunit is identified by modifying the above reaction 
to include only the PolC subunit and using 2.5 pg activated calf thymus DNA as a 
substrate, instead of singly-primed M1 3mp1 8 DNA, as previously described. 

15 Several techniques are utilized to determine the interaction of inhibitors with 

individual subunits. These have been described in the literature and include the 
following: (1) Nuclear magnetic resonance and capillary electrophoresis. 

20 

Example 14 
Era GTPase in Alloiococcus otitipis 

The era (E. coli Ras) gene was initially identified while sequencing around the 
25 mc gene; era lies downstream of rnc. While a function for era has yet to be 

determined, conditional (temperature sensitive) mutants revealed that the product of 
the era gene, Era, is essential for E. coli viability. A hint as to an in vivo function for 
Era was uncovered when a suppressor of a dnaG (primase) allele was found to map 
in the era coding sequence and a second suppressor, which mapped upstream of the 
30 era open reading frame, affected expression of era. These data suggest that Era 
could play one or more roles in DNA replication, regulation of primase activity or 
otherwise effect cell cycle progression. More recent data has confirmed that the era 1 
mutant causes a defect in cell growth at the two-cell stage and delays cell division 
Moreover, Britton et al demonstrated that cell division was coupled with the level of 
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Era in the cell: division arrest, through reduction in Era levels, is reversed when Era 
levels return to threshold amount. A current model suggests that Era acts as a 
checkpoint regulator in the bacterial cell cycle. Era is a GTP-binding protein with 
GTPase activity, a threshold level of functional/activated Era may be required to 

5 initiate septation. 

Era is associated with additional cellular functions, specifically translation, as 
Era specifically interacts with the translation machinery. E. coli Era binds both 1 6S 
rRNA and the 30S ribosomal subunit; whereas, the S. pneumoniae 16S rRNA co- 
purifies with Era. A putative RNA binding "KH motif has been identified in the 

10 carboxyl-terminal domain. The RNA binding activity is critical to Era cellular function 
as mutation of the putative RNA binding region of the S. pneumoniae Era prevents 
complementation of an E. cofi era mutant strain. Homologue of this gene identified in 
Alloiococcus otitidis as described in Example 5 (Seq. ID No 65). The protein encoded 
by the gene is set forth in Seq. ID No. 66. 

15 

Nucleotide binding 

Filter-binding assays are utilized to demonstrate nucleotide-binding specific to 
GTP and not UTP, CTP or ATP. Both GTP and GDP (unlabeled) were capable of 
inhibiting a 32 P-GTP binding. The Kd for GTP and GDP binding were reported to be 

20 5.5 and 1 .0 pM, respectively. 

A large number of GTP-binding proteins have been studied and all members 
of the family contain three regions of highly homologous amino acid residues that 
define a GTP-binding pocket. Era contains well-conserved regions defining the so- 
called G1 (G/AXXXXGKT/S: residues 15-22), G3 (DXXG: residues 62-65) and G4 

25 (NKXD: residues 124-128) consensus sequences. The G2 domain (residues 33-38, 
see below), located between G1 and G3, is generally more variable. 

GTPase activity 

Purified Era showed a significant GTPase activity, which is inhibitable by GTP 
30 or GDP but not by UTP, CTP, ATP or ADP. The maximum hydrolysis rate is 

measured at 9.8 mmol GTP hydrolyzed/min/mol Era. The Km was found to be 9 pM. 

It should be noted that Sullivan et al demonstrated, using mant (JV-methyl-3'- 
O-anthraniloyl) labeled GTP and GDP, very rapid exchange kinetics for guanine 
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nucleotide binding. Era exchanges guanine nucleotides 10-fold more rapidly than the 
GTP hydrolysis rate suggesting that guanine nucleotide binding and release should 
be considered as a regulatory point in addition to the more well-studied hydrolysis 
step. 

5 

Autoph osphoryl ati on 

When y^P-GTP is used as a substrate for the GTPase activity , Era is 
phosphorylated. The autophosphorylation reaction is specific for GTP, as incubation 
with y^P-ATP did not result in phosphorylation of Era. Moreover, a 32 P-GTP is not a 

10 suitable substrate for detection of Era autophosphorylation. Tryptic digestion and 
HPLC were utilized to resolve the sites(s) of phosphorylation. Using y^P-GTP as a 
substrate the major radioactive peak contained the tryptic peptide, ISITSR, 
corresponding to Era residues 33-38 and containing 3 potential phosphorylation 
sites. Mutagenesis of both Thr-36 and Ser-37 to alanine abolished enzymatic 

15 activity. However, individual alanine substitutions at either site had no effect on Era 
function. The autophosphorylation site is located in the so-called G2 domain of Era. 

Suitability of target for anti-infective development 

Era is an essential protein for bacterial viability. Knock-down mutations as well 
20 as conditional-lethal alleles revealed that Era function is required for cytokinesis. An 
additional phenotype of the Era-depleted strains is an aberrant response to 
temperature induced stress. This target is novel and may well lead to the 
identification of new classes of anti-infectives. The widespread distribution of Era 
homologues in both gram-negative and gram-positive pathogens suggests that 
25 broad-spectrum agents could result from an effort to define Era inhibitory 
compounds. 

Assays for measuring Era function 

30 NUCLEOTIDE BINDING ASSAYS 

Era binding to nucleotide is monitored by a simple filter-binding assay. Era 
(1 -5 pg) is incubated with a^P-GTP (0.2 ^Ci) in a buffer consisting of 1 00 mM Tris 
(pH 7.5), 10 mM MgCI 2) 0.2% NP-40, 0.2 mg/ml BSA for 30 minutes at 32*C. A 
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portion of the reaction mix is spotted on nitrocellulose membrane, washed (50 mM 
-Tris (pH 7.5), 5 mM MgCI 2 , 1 mM DTT) and dried. The membrane is then exposed to 
X-ray film. Alternatively, the spots are excised and counted. This assay is directly 
amenable to HTS using filter plates. 

5 

GTPase activity Assay 

The GTP hydrolytic activity of Era is monitored using thin-layer 
chromatography. Era and cr^P-GTP is incubated in 50 mM Tris (pH 7.5), 5 mM 
MgCI2, 0.1 % NP-40, 0.2 mg/ml BSA for 30 minutes at 37°C. An aliquot of the 

10 reaction is placed on PEI cellulose and the strip developed with 0.5 M KH 2 PQ 4 , 1 .0 M 
NaCI (pH 3.7). The spots conforming to GDP and GTP are identified by UV 
shadowing, excised and counted. This assay represents an acceptable 
secondary/confirmatory assay. 

Alternatively, the hydrolysis of y^P-GTP is monitored by assaying for 

15 liberated Pj. Obg and a 32 P-GTP is incubated in 50 mM Tris (pH 8.5), 1 .5 mM MgCl 2 , 
0.1 mM EDTA, 100 mM KCI, 10% glycerol for 30 minutes to 3 hours at 37°C. The 
reaction will be stopped by the addition of a slurry of charcoal in 1 mM Kpi (pH 7.5), 
which selectively binds the GTP and GDP. The liberated Pj in the supernatant is 
monitored by Cerenkov counting. Free P s can also be monitored with the Malachite 

20 Green reagent. 

AUTOPHOSPHORYLATION ASSAY 

Era autophosphorylation is monitored by incubating Era with y^P-GTP in 50 mM 
morpholinopropane sulphate (pH 6.8), 5 mM MgCI2, 1 mM DTT at 37°C (14)." 
25 Samples are analyzed following separation oh SDS polyacrylamide gels, drying the 
gel and exposure to film. This assay represents an acceptable 
secondary/confirmatory assay for Era activity. 
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EXAMPLE 15 

FmhB(FemX) Genes in Alloiococcus Otitidis 

5 The femA, femB, and fmhB(femX) genes have been shown to be essential for 

incorporation of glycine into the side chain of peptidoglycan precursors in 
Staphylococcus aureus,. The femAB locus was initially identified as a factor essential 
for methicillin resistance {fern) based on random insertional inactivation of 
chromosomal genes and a screen for reduced expression of resistance mediated by 

10 the penicillin binding protein 2A (PBP2A). Inactivation of femA or femB was 

subsequently reported to prevent incorporation of glycine residues at positions 2 to 5 
or positions 4 to 5 of the penta-glycine cross bridge since muropeptides cross-linked 
by one or three glycine residues were detected in the corresponding mutants. 
Inactivation of fmhB, formerly femX, is lethal, but the construction of a mutant 

15 conditionally expressing fmhB under the control of a xylose-inducible promoter 
showed that the gene was essential for synthesis of branched peptidoglycan 
precursors . These studies show that the fern gene products were required for 
incorporation of glycine at positions 1 (FmhB), 2 and 3 (FemA), and 4 and 5 (FemB) 
of the cross bridge, although the catalytic activity of the proteins has not been directly 

20 assessed. Similarly, inactivation of two fmhB homologues in Streptococcus 

pneumoniae, designated murM(fibA) and murN(fibB), reduced addition of L-Ala or L- 
Ser to the -amino group of L-Lys and subsequent addition of a second L-Ala residue, 
respectively. Overall, disruption of the murMN operon reduced the proportion of 
branched peptide stems in the peptidoglycan from 89 to 33% . In contrast to what 

25 occurs in S. aureus, direct cross-linking of L-Lys to D-Ala occurs in S. pneumoniae, 
and the murMN operon was accordingly reported to be unessential. 

BLAST analysis of Alloiococcus otitis genome revealed an ORF similar to 
femXof Weissella viridescent , and fmhB of S. aureus. It suggests that in 
Alloiococcus otitis there is an enzyme with similar to FhmB function. Homologue of 

30 this gene identified in Alloiococcus otitidis is described in Example 5 /Table 4 (Seq. 
ID No 97). The protein encoded by the gene is set forth in Seq. ID No. 98. 
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Assays for measuring FmhB function 

There are no in vitro biochemical assays to test enzymatic activity of S. 
aureus FmhB because the reaction occurs at the membrane-bound lipid II precursor 
GlcNAc-(P-1 ,4)-/V- acetylmuramic acid(-L-Ala-D-iGln-L-Lys-D-Ala-D-Aia)- 

5 pyrophosphoryi-undecaprenol. 

Lipid II is a minor component of bacterial cell membrane which is detected by 
thin-layer chromatography separation of presolubilized membranes supplied with the 
cytoplasmic precursors, UDP-N-acetylmuramyl-pentapeptide (UDP-MurNAc- 
pentapeptide) and [ 14 C]UDP-/S/-acetylglucosamine ([ 14 C]UDP-GlcNAc). 

10 The in vitro biosynthesis of branched lipid II of S. aureus requires whole-cell 

membranes, cytoplasmic PG precursors, glycine ( 14 C labeled for detection of reaction 
products), purified tRNA, and an intracellular fraction that contains tRNA-activating 
enzymes. Therefore, the in vitro assay of S. aureus FmhB is a tedious procedure. 
One way to facilitate this procedure is to use Weisselia viridescensFemX or 

15 E. faecalis UDP-MurNac-pentapetide:L-alanine ligase. Recombinant Weisseila 
viridescensFemX and E. faecalis UDP-MurNac-pentapetide:L-alanine ligase were 
purified, and their in vitro activity was demonstrated. The distinctive feature of these 
enzymes is that they catalyze the addition of a branching amino acid (Ala) to the 
cytoplasmic cell wall precursor UDP-MurNac-pentapetide. 

20 Other bacteria for which the biosynthesis of Gly-containing branched UDP- 

MurNac-hexapeptide in cytoplasm was shown are Streptomyces lividans and 
Streptomyces hydroscopicus , although the enzymes were not isolated and their 
ligase activity remain to be demonstrated. 

These new data open an opportunity to develop an assay to detect the 

25 activity of FmhB(FemX) by using cytoplasmic UDP-MurNac-pentapetide. 

Products of the reaction are detected by HPLC. HPLC separation of precursors are 
performed by the method of Flouret et al. The precursors are separated by reverse- 
phase HPLC on a/yBondapak Ci8 column (3.9 by 300 mm; Waters) in 50 mM 
ammonium formate (pH 3.9) at a flow rate of 0.5 ml/min. The elution of precursors is 

30 monitored at a wavelength of 254 nm. 
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Example 16 
FolA- Dihydrofolate reductase (DHFR) 

5 The Alloiococcus ORF-1 863 encodes a homolog of S. aureus dihydrofolate 

reductase that catalyzes the NADPH-dependent conversion of dihydrofolate to 
tetrahydrofolate, one of the steps in bacterial folate biosynthesis. Homologue of this 
gene identified in Alloiococcus otitidis is described in Example 5/Tabie 4 (Seq. ID No 
55). The protein encoded by the gene is set forth in Seq. ID No. 56. 

10 

FolA as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in bacteria, 
such as purine, pyrimidine, amino acid and pantothenate biosynthesis. Unlike 
mammalian cells, bacteria are unable to utilize exogenous folate derivatives, and 

15 therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via two 
converging pathways, the non-essential para-amino-benzoate (PABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Bacterial DHFRs are essential for viability and 
well conserved across all bacterial species. Although bacterial DHFR shares 

20 similarity with human DHFR, selective inhibitors against bacterial DHFR have been 
identified in the past such as trimethoprim which specifically blocks the bacterial 
DHFR step. Thus DHFR still remains an attractive target for development of broad- 
spectrum antibacterial agents. 

25 Assays for measuring DHFR activity 

DHFR activity is monitored spectrophotometrically, recording the change of 
absorbance at 340 nm due to the equimolar consumption of NADPH in the course of 
dihydrofolate substrate reduction. DHFR (10 ng) is preincubated in reaction buffer 
containing 50 mM 2-(N-morpholino)ethanesulfonic acid, 25 mM Tris-HCI, 25 mM 

30 ethanolamine, and 1 00 mM NaCI at pH 7.5 for 3 minutes. The reaction is started by 
addition of 0.5-10 jmM 7,8-dihydrofolate. The amount of processed substrate is 
calculated from the decrease of absorbance at 340 nm due to oxidation of NADPH 
(□=1 1800 M* 1 cm" 1 ) to NADP + . 



-97- 



WO 03/104391 



PCTAJS02/36122 



EXAMPLE 17 

FolB- Dihyproneopterin aldolase (DHNA) 

5 The Alloiococcus otitidis ORF-959 encodes a homolog of S. aureus 

dihyclroneopterin aldolase that catalyzes the conversion of 7,8-dihydroneopterin to 6- 
hydroxymethyl-7,8-dihydropterin, one of the early steps in bacterial folate 
biosynthesis. Homologue of this gene identified in Alloiococcus otitidis is described in 
Example 5/Table 4 (Seq. ID No 31). The protein encoded by the gene is set forth in 

10 Seq. ID No. 32. 

FolB as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in 
bacteria, such as purine, pyrimidine, amino acid and pantothenate biosynthesis. 

15 Unlike mammalian cells, bacteria are unable to utilize exogenous folate derivatives, 
and therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via 
two converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Enzymes that catalyze steps in the folate 

20 biosynthesis pathway are essential and well conserved across all bacterial species, 
and those that act in early steps such as FolB have no direct homologs in mammals. 
Thus FolB becomes an attractive target for development of broad-spectrum 
antibacterial agents. 

25 Assays for measuring FolB activity 

FolB (DHNA) 7,8-dihydroneopterin aldolase activity is monitored individually 
or in conjunction with downstream enzymes in folic acid biosynthesis pathway (FolK 
and Sul). 

FolB activity is monitored directly by HPLC assay. FolB substrate (7,8- 
30 dihydro-D-neopterin) is commercially available from Schircks Laboratories 

(Swizerland). FolB (0.5 \xg) is preincubated in reaction buffer containing 50 mM Tris- 
HCI (pH 8.0), 50 mM KCI, 0.1 mg/ml BSA, 2.5 mM dithiothrietol for 5 min. Reaction 
is started by addition of stock solution of 7,8-dihydro-D-neopterin in DMSO (1 00 p-M 

-98- 



WO 03/104391 



PCTYUS02/3(U22 



final concentration). Reaction is terminated by addition of 1/3 of reaction volume of 
1 % l 2) 2% Kl in 1 M HCI with subsequent incubation at room temperature for 5 
minutes. Quenched reaction will be applied directly to HPLC. Oxidized starting 
material and reaction products are efficiently separated on ODS (C18) column. 

5 Reaction components are detected and quantified by analysis of UV absorbance at 
254 nm, or fluorescence (excitation at 365 nm; emission at 446 nm). 

FolB activity are also monitored in the coupled assay with FolK (HPPK) and Sul 
(DHPS) enzymes. FolB activity is measured by detection of radioactive 
dihydropteroate formation as described in FolK and Sul assays, under conditions of 

10 excess of the later enzymes. FolB enzyme and substrate 7,8-dihydro-D-neopterin 
are added to the described assay to replace the 6-hydroxymethyl-7,8-dihydropterin 
(FolK substrate). 

Example 18 

15 FolC- Dihydrofolate synthase (PHFS) 

The Alloiococcus otitidis ORF-956 and ORF-528 both encode a homolog of B. 
subtilis dihydrofolate synthase that catalyzes the conversion of 7,8-dihydropteroate 
and glutamate to dihydrofolate, one of the steps in bacterial folate biosynthesis [. 
20 Homologue of this gene identified in Alloiococcus otitidis as described in Example 5 
(Seq. ID Nos. 29 and 23). The protein encoded by the gene is set forth in Seq. ID 
Nos. 30 and 24. 

Use of FolC as a target for anti-infective development 

25 Folate is an essential cofactor in many important metabolic processes in bacteria, 

such as purine, pyrimidine, amino acid and pantothenate biosynthesis. Unlike 
mammalian cells, bacteria are unable to utilize exogenous folate derivatives, and 
therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via two 
converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 

30 pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Enzymes that catalyze steps in the folate 
biosynthesis pathway are essential, and are well conserved across all bacterial 
species. Bacterial FolC appears to be a bifunctional enzyme possessing both 
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dihydrofolate synthase (DHFS) activity and folyl-polyglutamate synthetase (FPGS) 
activity, which are probably mediated through different sites of the protein. The 
bacterial DHFS activity but not the FPGS activity is essential for viability. Although 
bacterial FolC shares similarity with human FPGS, the human enzymes apparently 
5 lack DHFS activity and display a folate substrate specificity quite distinct from that of 
bacteria! enzymes. Thus targeting bacterial FolC/DHFS activity selectively might lead 
to identification of broad-spectrum antibacterial agents. 

Assays for measuring FolC activity 
10 FolC (DHFS) 7,8-dihydrofolate synthase activity in the presence or absence 

of antimicrobial compounds or putative inhibitory compounds are monitored by 
several methods. 

In one method, FolC activity is monitored directly by simple HPLC assay. 
FolC substrate (7,8-dihydropteroic acid) is commercially available form Schircks 

15 Laboratories (Switzerland). FolC (15 ng) is added to reaction mix, containing 10 mM 
glutamate, 5 mM ATP, 50 mM Tris-HCI (pH 8.0), 20 mM Mg 2 CI, 50 mM KCI, 0.1 
mg/ml BSA, 5 mM dithiothreitol. Reaction is started by addition of stock solution of 
7,8-dihydropteroic acid in DMSO (10 piM final concentration). Reaction is terminated 
by addition of equal volume of 8M Guanidinium hydrochloride. Stopped reaction is 

20 applied directly to HPLC. Starting material and reaction products are efficiently 

separated on ODS (C1 8) column. Reaction components are detected and quantified 
by analysis of UV absorbance at 254 nm, or fluorescence (excitation at 280 nm; 
emission at 420 nm). 

In another method, the FolC activity monitoring is by detection of ADP 

25 accumulation. ADP is released in the amount equimolar to the amount of the product 
formed. ADP detection is performed by coupling its conversion to ATP by pyruvate 
kinase in the presence of phospho(enol)pyruvate producing pyruvate. Lactate 
dehydrogenase reduces pyruvate to S-lactate in the presence of NADH. Course of 
reaction is monitored by decrease in absorbance at 340 nm due to oxidation of 

30 NADH (8=6220 cm' 1 M' 1 ) to NAD + . Reaction conditions are as following: 5 mM 
dithiothreitol, 5 mM ATP, 380 ^iM NADH, 10 mM glutamate, 2 mM 
phospho(enol)pyruvate, 50 mM KCI, 20 mM Mg 2 CI, 50 mM Tris-HCI, 50 fig of 
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pyruvate kinase, 50 \ig of S-lactate dehydrogenase. Reaction is started by addition 
7,8-dihydropteroic acid in DMSO (10 jiM final concentration). 

In yet another method, FolC activity is monitored through detection of 
inorganic phospate release. Amount of inorganic phosphate in solution is quantified 
5 by: 

(i) its conversion by purinenucleoside phosphorylase leading to 
phosphorylation of MESG. Later assay kit is available from Molecular Probes 
as EnzCheck™ Phosphate Assay Kit; 

(ii) its reaction with Malachite Green reagent; and 

10 (iii) detecting the release of radioactive inorganic phosphate in reaction with y- 

33 P-labeled ATP following the absorption of unprocessed ATP by charcoal. 

First method is applied in rate-based assay format; the later two in 
end-point format. Reaction conditions are similar to the ones described in 
HPLG-based assay. 

15 

Example 19 

folk- 6-hyproxymethyl-7, 8-dlhypropterin pyrophosphok1nase f hppk) 

The Alloiococcus otitidis OFR-961 (Seq. ID No. 33) encodes a homolog of S. 
20 aureus 6-hydroxymethy!-7,8-dihydropterin pyrophosphokinase that catalyzes 

pyrophosphoryl transfer from ATP to 6-hydroxymethyl-7,8-dihydropterin, one of the 
early steps in bacterial folate biosynthesis. The protein encoded by this ORF is set 
forth in Seq. ID No. 34. (see Example 5/Table 4). 

25 Use of FolK as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in 
bacteria, such as purine, pyrimidine, amino acid and pantothenate biosynthesis. 
Unlike mammalian ceils, bacteria are unable to utilize exogenous folate derivatives, 
and therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via 
30 two converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Enzymes that catalyze steps in the folate 
biosynthesis pathway are essential and well conserved across all bacterial species, 
and those that act in early steps such as FolK have no direct homologs in mammals. 
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Thus FolK is an attractive target for the development of broad-spectrum antibacterial 
agents. 

Assays for measuring FolK activity 

5 

FolK (HPPK) 7,8-dihydroxymethylpterin-pyrophosphokinase activity is 
monitored individually or in conjunction with downstream enzyme in folic acid 
biosynthesis pathway. 

FolK activity is monitored directly by HPLC assay. FolK substrate (7,8- 

10 dihydro-6-hydroxymethylpterin) is commercially available from Schircks Laboratories 
(Swizerland). FolK is preincubated in reaction buffer containing 50 mM Tris-HCI (pH 
8.0), 50 mM KCI, 20 mM MgCI 2 , 5 mM ATP, 0.1 mg/ml BSA, 2.5 mM dithiothrietol. 
Reaction is started by addition of stock solution of 7,8-dihydro-6-hydroxymethylpterin 
in DMSO (1 00 p.M final concentration). Reaction is terminated by addition of equal 

15 . volume of 8M Guanidinium hydrochloride and applied directly on HPLC. Starting 
material and reaction products are efficiently separated on ODS (C18) column. 
Reaction components are detected and quantified by analysis of UV absorbance at 
254 nm. 

FolK activity is monitored by end-point assay coupled with excess of Sul enzyme. 
20 Activity is calculated from quantification of the radioactivity incorporated in final 
product (7,8-dihydropteroate). 

Example 20 

alloiococcus otitidis encoded folp (sul)- dlhydrqpteroate synthase ( dhps) 

25 

The Alloiococcus otitidis ORF-1 81 1 (Seq. ID No. 53) encodes a homolog of B. 
subtifis dihydropteroate synthase that catalyzes the condensation of pABA (para- 
aminobenzoic acid) with 6-hydroxymethyl-7,8-dihydropterin pyrophosphate, one of 
the early steps in bacterial folate biosynthesis. The polypeptide encoded by this ORF 
30 is set forth in Seq. ID No. 54. (see Example 5/Table 4) 

FolP as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in bacteria, 
such as purine, pyrimidine, amino acid and pantothenate biosynthesis. Unlike 
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mammalian cells, bacteria are unable to utilize exogenous folate derivatives, and 
therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via two 
converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 

5 attached to form the folate precursor. Enzymes that catalyze steps in the folate 
biosynthesis pathway are essential and well conserved across all bacterial species, 
and those that act in early steps such as FolP (Sul) have no direct homologs in 
mammals. In fact, dihydropteroate synthase (FolP or Sul) is the target for known 
antibiotics sulfonamides which are competitive inhibitors of FolP/Sul as pABA 

10 analogues. Thus FolP (Sul) still remains an attractive target for development of 
broad-spectrum antibacterial agents. 

Suitable assays for measuring FolP/Sul activity 

Sul (DHPS) 6-hydroxymethy-7 f 8-dihydroneopteroate synthase activity is 

15 monitored individually or in conjunction with upstream enzymes in folic acid 
biosynthesis pathway (FolB and/or FolK). 

DHPS activity is monitored directly by counting the amount of radioactivity 
incorporated in 6-hydroxymethy-7,8-dihydroneopteroate when using radioactively 
labeled p-aminobenzoic acid (pABA). Final product is separated from unreacted 

20 pABA by thinlayer chromatography, paper chromatography or on HPLC equipped 
with radioactivity detector. DHPS substrate (6-hydroxymethy!-7,8-dihydropterin 
pyrophosphate) is not commercially available, but is quantitatively synthesized in one 
step from its oxidized precursor available from Schircks Laboratories (Swizerland). 
DHPS (20 ng) is added in reaction buffer containing 50 mM Tris-HCl, pH 8.0, 20 mM 

25 MgCI 2 , 0.1 mg/ml BSA, 5 mM dithiothreitol and 0.5-10 p.M PABA. Reaction is 

started by addition of stock solution of substrate (6-hydroxymethyl-7, 8-dihydropterin 
pyrophosphate, 0.05 - 1 p.M final concentration). Reaction is terminated by 
acidification of reaction volume with addition of equal volume of citrate/phosphate or 
ammonium acetate/acetate buffer, pH 4 containing excess of uniabelled pABA. 

30 Quenched reaction is separated by chromatography and the amount of formed 
product calculated. 

DHPS activity is determined in coupled assay with excess of FolB and FolK 
enzymes. The advantage of coupled assay is that it makes it possible to use 
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commercially available FolB (7,8-dihydro-D-neopterin), or FoIK (6-hydroxymethyl-7,8- 
dihydropterin) substrates, thus forming DHPS substrate In situ. 



Example 21 

ALLOIOCOCCUS OTITIDtS ENCODED Fl LAMENTATION TEMPERATURE SENSITIVE GENE A 

(FtsA) 



The Alloiococcus otitidis ORF-2489 (Seq. ID No. 85) encodes a homolog of E. 

10 faecalis FtsA, one of the essential components of bacteria! cell division. The "fts" 
stands for f ilamentation temperature sensitive and has been assigned to most 
bacterial cell division genes due to the fact that these genes were generally 
discovered by the isolation of conditional mutants that form filaments at 
nonpermissive temperature . The ftsA allele was first isolated and identified in E. coli 

15 by Ricard and Hirota in 1 973, and mapped along with ftsZ in 1 980.The protein 
encoded by this ORF is set forth in Seq. ID No. 86. (see Example Stable 4) 

Bacterial cell division requires formation of a septum at mid-cell that begins 
with the polymerization of FtsZ into a ring structure at the nascent division site. FtsZ, 
another key component of bacterial septation is the first known protein to localize to 

20 the division site. In E. co//, shortly after the formation of the FtsZ ring, FtsA and ZipA 
(another key division component present only in gram-negative bacteria) [7] are 
independently recruited to the septal ring, most likely through their direct interaction 
with FtsZ. Subsequent assembly of other division components at the septum requires 
FtsA as well as FtsZ. 

25 

FtsA as a target for anti-Infective development 

Like FtsZ, FtsA homologs are present and highly conserved in almost all 
eubacteria. FtsA is essential for cell division and its deletion leads to impaired cell 
division and sporulation defect. In addition, E, coli cells have to maintain critical ratio 
30 of FtsA to FtsZ in order for proper cell division to occur. FtsA belongs to the 

actin/DnaK/sugar kinase family of proteins. In B. subtilis, FtsA acting as a dimer not 
only binds ATP but also hydrolyzes ATP. As briefly stated above, in vivo and in vitro 
evidence have demonstrated that FtsA and FtsZ from various bacterial species 
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directly interact. Taken all together, targeting at FtsA especially at its interaction with 
FtsZ might lead to identification of broad-spectrum antibacterial agents. 

Assays for measuring FtsA activity 

5 

ATPase activity of FtsA is assayed by following the formation of ^Pi from [y-^P]- 
ATP. The reaction mixture containing 50 mM Tris-HCl (pH7.2), 50 mM potassium 
acetate, 1 mM DTT, 1 0 mM MgCI 2 and different concentrations of [y-^-ATP is 
incubated for 5 minutes at 37°C. The reaction is started by addition of 50 nM purified 

10 FtsA of Alloiococcus. The reaction is stopped with 1 .5% ammonium molybdate in 
0.5N sulfuric acid, and the radioactive Pi extracted into isoamyl alcohol and counted. 

Interaction between FtsA and FtsZ is detected quantitatively using yeast two- 
hybrid system as described. Briefly, Alloiococcus ftsZls cloned into yeast two-hybrid 
bait vector pLexA (Clontech) to generate a LexA-FtsZ fusion with DNA-binding 

15 property. Alloiococcus ftsA is cloned into the target vector pB42AD (Clontech) to 
fuse FtsA to the activating domain. Both plasmids are then transformed into a 
Saccharomycyces cerevisiae strain containing a lacZ reporter under the control of 
multiple LexA operators. P-Galactosidase activity is determined to quantify relative 
strength of FtsA-FtsZ interaction. 

20 

Example 21 

alloiococcus otitidis encoded fl lamentation temperature s ensitive gene z 

(FTSZ) 

25 FtsZ is an essential protein that forms a cytokinetic ring (Z-ring) that drives 

cell division in bacteria. FtsZ has been identified in most prokaryotic species with the 
exception of Chlamidia, a Ureaplasma species and Crenarchaea. FtsZ and Z-ring 
formation are most extensively studied in E. coll FtsZ is an abundant cytoplasmic 
protein which is present at - 10 4 copies per cell, and is the first protein to be localized 

30 to the division site. Z-ring is required throughout septation and directs the ingrowth of 
septum in part by recruiting other cell division protein to the division site. Another 
function is suggested by FtsZ homology to eukaryotic tubulins. Like tubulin, FtsZ is a 
GTPase and undergoes GTP/GDP-dependent polymerization. Recent studies 
showed that Z-ring is a very dynamic structure suggesting that GTP-dependent 
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assembly/disassembly of Z-ring might provide constriction force to power cell 
division. Homologue of this gene identified in Alloiococcus otitidis is described in 
Example Stable 4 (Seq. ID No 83). The protein encoded by the gene is set forth in 
Seq. ID No. 84. 

5 . 

GTPase activity 

FtsZ is a GTPase that contains the tubulin-signature nucleotide-binding motif 
GGGTGS/TG. Like in □□□-tubulin dimer, the active site for GTP-hydrolysis appears 
to be shared between two subunits where the GTP-binding pocket is provided by one 
10 subunit while the GTPase-activating T7 loop comes from the other subunit This view 
is supported by genetic analysis as various mutations that inhibit FtsZ GTPase 
activity map in the T7-loop region and a conserved Asp-residue in T7-loop is found to 
be involved in the coordination of the cation involved in GTP hydrolysis. FtsZ 
GTPase activity is Mg 2+ -dependent and is stimulated by KCI. 

15 

Polymerization 

In vivo, about 75% of FtsZ is present as multimers. In vitro, FtsZ forms a 
variety of structures at various conditions. FtsZ assembles into thin protofilaments 
with GTP and formation of FtsZ polymers is coupled to GTP hydrolysis: when GTP 
20 runs out, polymers disassemble. Protofilaments assemble into sheets and bundles in 
the presence of multimolar amounts of either Mg 2+ or Ca 2+ or by addition of DEAE- 
dextran. In addition, ZipA protein induces bundling of FtsZ polymers. With GDP, FtsZ 
assembles into curved filaments and minirings. 

25 Interactions with other proteins 

In E. coi'u at least nine different proteins are localized to the division septum and 
are required for cell division to proceed. Among them two proteins, ZipA and FtsA, 
are shown to interact directly with FtsZ. Both of these proteins localize to the division 
site independently from each other, but require FtsZ for localization. ZipA is an 
30 integral membrane protein which is thought to mediate invagination of cell membrane 
by linking the membrane to constricting Z-ring. Interaction between ZipA and FtsZ is 
confined to C-terminal portion of ZipA (residues 1 85-328) and conserved 17-amino 
acid region on C-terminus of FtsZ. FtsA is an actin-like membrane-associated protein 
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which possesses ATPase activity and might provide energy required for 2-ring 
dynamics. Interaction between FtsZ and FtsA is not studied in great detail, it is shown 
that C-terminus of FtsZ is required. The remaining division proteins require both ZipA 
and FtsA for their localization to Z-ring. 

5 

FtsZ as a target for anti-infective development 

FtsZ is an essential protein for cell division/bacterial viability. Knock-out ftsZ 
mutants fail to divide and, as a result, filament and die. The target is widely 

10 conserved throughout bacterial kingdom implying that FtsZ-specific inhibitor would 
have a broad-spectrum antibacterial activity. The potential drawbacks of the target 
might include the presence and the essential role of a homolog (tubulin) in 
eukaryotes and an intrinsic difficulty in inhibiting protein-protein interactions by small 
molecules. Although this target is being studied extensively, no FtsZ-specific 

15 compounds are reported up to date. 

Assays for measuring FtsZ function 

Polymerization of FtsZ is measured by light scattering assay as described 
previously. FtsZ (12.5 pM) is incubated in 200 pi of polymerization buffer (50 mM 

20 MES/NaOH, pH 6.5, 50 mM KCi, 5 mM MgCI 2 , 10 mM CaCl 2 ) in a fluorescence 

cuvette with a 1 cm path length. The sample is maintained at 30°C, polymerization is 
induced by addition of 20-500 pM GTP. Light scattering is measured at 90°, both 
excitation and emission wavelengths are set to 350 nm, slit width is 2 nm. 
Alternatively, the amount of polymerized FtsZ is analyzed by sedimentation and 

25 subsequent quantification of precipitated FtsZ by SDS-PAGE, Coomassie staining 
and densitometric scanning. In addition, polymers are observed by electron 
microscopy. This assay represents either primary or secondary/confirmatory assay. 

GTP binding of FtsZ is monitored by the covalent cross-linking of [y-^PJGTP 
(3000 Ci/mmol) to FtsZ in a previously described competition assay. FtsZ (3 pg) is 

30 incubated in 20 pi of 50 mM MES/NaOH, pH 6.5, 100 mM KCI, 4 mM MgCI 2 , 1 mM 
EDTA, 0.1 mM EGTA and 0.5 mM DTT. Various amounts of non-labeled competing 
nucleotide (GTP or GTP analogs) and 0.1 mM [y- 32 P]GTP are added, samples are 
incubated at 0°C for 15 min, then UV cross-linked for 5 min and analyzed by SDS- 
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PAGE on 12% gel, autoradiography and densitometric scanning. This assay 
represents a secondary/confirmatory assay. 

The GTP hydrolytic activity of FtsZ is monitored by thin-layer chromatography 
(TLC) as described previously. Briefly, the reaction mixture consists of 5 mM of [y- 

5 ^GTP (40 mCi/mmol), 1 5 mM magnesium acetate and 0.25-2 mg/ml of FtsZ in 
reaction buffer (40 mM Tris-acetate, pH7, 200 mM potassium acetate, 2 mM EDTA, 1 
mM DTT and 0.5% Triton X-100), aliquots are separated by TLC and amount of GTP 
converted to GDP is determined by spot-densitometry. Alternatively, GTPase activity 
is measured either by quantitation of the non-radioactive inorganic phosphate with 

10 the malachite green-molybdate reagent as described previously or by quantitation by 
scintillation counting of radioactive inorganic phosphate released after hydrolysis of 
[Y-^PJGTP (26). This assay represents either primary or secondary/confirmatory 
assay. 

Among interactions of FtsZ with various cell division proteins, interaction 
15 between FtsZ and ZipA is characterized the best. ZipA -induced bundling of FtsZ ts 
measured by the light scattering assay that is described above, both proteins are 
used at s5 pM. 

Example 22 

20 ALLOIOCOCCUS OT1TIDIS ENCODED GYRA/GYRB (DNA GYRASE, TOPOISOMERASE II) 

AND GRLA/GRLB (TOPOISOMERASE IV) 

DNA topoisomerases: topoisomerases modulate the topological state of DNA 
in cells. This involves binding to DNA, introducing single or double stranded breaks 

25 in the DNA, passing DNA molecules through the break and rejoining the break. This 
controls the levels of positive and negative supercoiling of DNA and functions in 
catenation/decatenation. Controlling the topological state of DNA is critical to the 
fundamental processes of transcription, recombination, replication and partitioning of 
the chromosome. There are two main categories of topoisomerases, type I and type 

30 II. Type I topoisomerases introduce single stranded breaks in DNA whereas type II 
enzymes introduce double stranded breaks. GyrA/GyrB (gyrase) and GrIA/GrIB 
(topoisomerase IV) are both type II enzymes that are essential for cell viability. 

DNA gyrase (GyrA/GyrB) is a type II topoisomerase that functions to control 
the degree of supercoiling in double stranded DNA. It is essential for viability and 
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plays central roles in replication, repair, recombination and transcription of DNA. 
Gyrases have the ability to introduce double stranded breaks in DNA molecules while 
remaining bound to the DNA through phosphotyrosine bonds, pass uncut DNA 
through the break and then rejoin the breaks, with repeated cycles being driven by 
5 the hydrolysis of ATP. Gyrase has the unique ability to introduce negative supercoils 
in closed circular DNA and also functions to catenate/decatenate DNA duplexes. 
The generation of negative supercoiling is important for initial stages in replication. 
DNA gyrase from Escherichia coii has been studied in detail. It is a complex of two 
subunits of GyrA (encoded by gyrA) and two subunits of GyrB (encoded by gyrB) (ie. 
10 A 2 B 2 complex). The subunits are organized in discreet domains. An N-terminal 
domain of GyrB harbors ATPase activity while the C-terminal domain is thought to 
interact with the GyrA subunit, and is involved in DNA binding. The N-terminal 
domain of GyrA is apparently involved in DNA strand breakage-ligation reactions 
while the C-terminal segment is involved in DNA binding. Crystal structures of the 
15 DNA strand breakage/reunion domain of E. coii GyrA, and the N-terminal ATPase 
domain of E. coii GyrB have been determined. DNA gyrase has also been purified 
and characterized from gram positive organisms such as S. aureus. Comparison of 
DNA gyrases from several bacteria reveal a high degree of conservation of important 
domains. 

20 Topoisomerase IV (GrlA/GrlB) is a type II topoisomerase but unlike gyrase it 

does not possess negative supercoiling activity. Its primary role in replication 
appears to be in the decatenation of multiply linked daughter chromosomes, 
important for terminal stages of the replication process. Topoisomerase IV has been 
purified and characterized from gram negatives eg. E. coii, (where the GrlA/GrlB 

25 subunit homologs are designated ParC and ParE), and gram positives eg S. aureus. 
Homologs of thse gene identified in Aiioiococcus otitidis is described in Example 
5/Tabie 4 (Seq. ID Nos 17 and 19). The proteins encoded by the genes are set forth 
in Seq. ID Nos. 1 8 and 20. 
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GyrA/GyrB (Gyrase) and GrlA/GrlB (topoisomerase IV) as targets for anti- 
infective development: 

Alloiococcus otitidis is an infectious organism associated with disease, and 

5 consequently, novel antimicrobials to combat these infections are desirable. DNA 
gyrase and Topoisomerase IV is essential for bacterial viability and is a well- 
established and validated antibacterial target. 

Purification of DNA gyrase and topoisomerase IV from Alloiococcus 

10 otitidis 

Genes encoding the GyrA/GyrB and GrlA/GrlB subunits or their functional 
domains are obtained using polymerase chain reaction amplification of the genomic 
region encoding them. The genes are then subcloned into standard expression 
vectors, with or without affinity tags. The enzyme is then overexpressed in 
15 Escherichia coli and purified using a standard tag system or conventional 
chromatography. 

Measurement of gyrase and topoisomerase IV by kinetoplast DNA 
decatenation assay: 
20 Type 11 topoisomerases introduce double stranded breaks in DNA and 

mediate catenation/decatenation of DNA. Topoisomerase IV activity is readily 

determined with decatenation assays using as substrate kinetoplast DNA (KDNA) 

from Crithidia fasciculata. The DNA isolated in this procedure is a highly networked 

series of catenated double stranded minicircles and is easily be pelleted by 

25 centrifugation. The activity of topoisomerase II enzymes results in the release of 

decatenated DNA minicircles from the networked KDNA. These have a high mobility 
in agarose gels and migrate into the gel ahead of the networked material, which has 
very low mobility, allowing for determination of decatenation activity using ethidium 
bromide stained agarose gel electrophoresis. 

30 Alternatively, using radiolabeled KDNA, the level of decatenation activity is 

measured by counting radioactivity remaining in reaction supernatants following 
centrifugation to pellet the networked material. Typical conditions used for assaying 
decatenation activity of S. aureus and E. coli topoisomerase IV activity are as follows: 
C. fasciculata KDNA (0.9 mg/ml) is incubated in 40 pi of reaction buffer (50 mM Tris- 

35 HC1, pH 7.7, 5 mM MgCI 2 , 5 mM DTT, 50 pg/ml bovine serum albumin, 1 .5 mM ATP 
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and 350 mM potassium glutamate) with appropriate amounts of the Grl subunits, for 
1 hour at 37° C. If non radiolabeled KDNA is used, these reactions can be stopped 
and analyzed by agarose gel electrophoresis, or for radioassays, the reaction is 
stopped by gentle mixing with 10 pi of stop solution (50 % glycerol, 50 mM EDTA (pH 

5 8.0), 2.5 % SDS and 0.1 % bromphenyl blue) and centrifuged at 15 000 x g for 5 min 
at 20° C. Decatenation activity is determined by counting radioactivity in 25 pi of the 
supernatant in a scintillation counter. Alternatively, a modified assay employing flow 
injection fluorometry of 4', 6-diaminidino-2-phenylindole (DAPI) treated supernatants 
has been described that could be suitable for moderate throughput non radioactive 

10 assays, or filtration of the reactions through appropriate filters may efficiently 
separate the decatenated species from KDNA. Although the above described 
assays were used for topoisomerase IV, modified decatenation reactions using 
KDNA isolated from Leishmania donovani reveal significant decatenation activity by 
gyrase from E. coli and Mycobacterium smegmatis, indicating the applicability of the 

15 assay to prokaryotic gyrases. 

DNA Supercoiling/relaxation assays. 

DNA gyrase function is directly assayed using a simple supercoiling assay 
typified by that described for the measurement of Escherichia coli DNA gyrase 

20 activity. Briefly, incubation of relaxed closed circular plasmid DNA (pUCl 8, 7.5 nM) 
in the presence of DNA gyrase (approximately 10 nM) in 40 mM Tris-HCI (pH 8.0) 
buffer containing 25 mM KCI, 4 mM MgCI2, 2.5 mM spermidine and 1 .4 mM ATP 
buffer results in the introduction of supercoils in the plasmid DNA. Changes in DNA 
supercoiling status are readily observed by the alteration of mobility of the DNA in 

25 agarose gels stained with ethidium bromide and comparison to the mobility of relaxed 
and supercoiied plasmid template. This strategy is employed for screening for DNA 
gyrase inhibitors. 

Topoisomerase IV activity is assayed by measuring relaxation of supercoiied 
plasmid DNA. A typical relaxation assay used for S. aureus topoisomerase IV 
30 activity is as follows: topoisomerase IV enzyme and supercoiied plasmid DNA 
(pBR322, 0.6 pg) is incubated in 40 pi 50 mM Tris-HCI, pH 7.7, containing 5 mM 
MgCI 2 , 5 mM DTT, 50 pg/ml bovine serum albumin, 1.5 mM ATP, 5 mM spermidine 
and 20 mM KCI, for 30 min at 37°C. Changes in DNA supercoiling status can be 
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readily observed by the alteration of mobility of the DNA in agarose gels stained with 
ethidium bromide and comparison to the mobility of relaxed and supercoiled plasmid 
template 

The ATPase activity of topoisomerases is measured using a coupled 
5 spectrophotometric ATPase assay described for the GyrB subunit of E. coli. ATPase 
activity is assayed in 300 pi of 40 mM Tris-HCI (pH 8.0), containing 25 mM KCI, 2.5 
mM spermidine, 4 mM MgCI2, 400 pM phosphoenolpyruvate, 250 pM NADH, 3 pi of 
pyruvate kinase /lactate dehydrogenase mix and ATP (0.5 — 3.5 mM). The reaction 
is started by the addition of truncated N-terminal derivatives of the GyrB protein (5 
10 pM) containing the ATPase domain. ATPase activity is reflected as a decrease in 
absorbance of light at 340 nanometer wavelength. 

DNA cleavage assay. 

Quinolone drugs interfere with the DNA strand breakage-ligation cycle activity 

15 of many topoisomerases. Incubation of topoisomerase and linear or supercoiled 

pBR322 plasmid DNA, or small linear DNA fragments, in the presence of quinolones 
and magnesium results in the trapping of a complex of topoisomerase, DNA with a 
double stranded break and the drug. The topoisomerase remains bound to the 
cleaved DNA, however treatment with a denaturant such as SDS or proteinases 

20 remove/degrade the gyrase, releasing the cut DNA. Certain consensus sequences 
representing preferred cut sites of E. coli gyrase in plasmid pBR322 have been 
identified in template DNA molecules used in these assays. This assay is useful for 
mode of action studies of inhibitors of gyrase/topoisomerase IV activity and in 
particular of the strand breakage-ligation function. Cleavage reactions are performed 

25 with linear or supercoiled DNA. A typical cleavage reaction using linear DNA to 
measure cleavage by E. coli and S. aureus gyrase and topoisomerase IV in the 
presence of drugs is as follows: gyrase/ topoisomerase IV is incubated in 20 pi 25 
mM Tris-HCI (pH 7.5) containing 0.5 mM EDTA, 0.5 mM DTT, 3 pg bovine serum 
albumin per ml, 10 mM MgCI 2 , 120mM KCL 10 mM ATP, 10 000 dpm of 3' end 

30 labeled linear pBR322 plasmid DNA and drug for 1 hour at 37°C. (Note: for S. 

aureus, KCI is replaced with 0.7 M potassium glutamate). Reactions are terminated 
by adding 5 pi 2.5% SDS-2.5 mg proteinase K per ml and incubating at 37°C for 30 
minute, then adding 5 pi 30% glycerol-! % SDS-50 mM EDTA-0.05 % bromophenol 
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blue. Cleavage products are resolved on 1 % agarose gels and visualized by 
autoradiography. 

Additional cleavage assays are also used that measure 1) the linearization of 
supercoiled plasmid DNA (pBR322), with linearization measured using scanning 

5 densitometry of DNA species separated on 1 % agarose gels, or 2) the cleavage of 
small linear DNA molecules of approximately 100 bp encompassing the preferred 
cleavage sequence 5 - GGCTGGATGGCCTTCCCCAT - 3' from position 990 in 
plasmid pBR322. In the latter case, the fragment is produced by PCR and 
radiolabeled with y-^P ATP at the 5* end of the top strand. This DNA is incubated 

10 with 1 .3 pmol DNA gyrase in a total volume of 10 yl 35 mM Tris-HCl (pH 8.0), 24 mM 
KCI, 2 mM spermidine, 4 mM MgCI2 and inhibitor compound at 37°C for 10 min. 
Reactions are stopped by addition of 8 mM EDTA and 1% SDS, then treated with 
500 pg/ml proteinase K for 2 hours at 37°C. The DNA is then cleaned by phenol- 
chloroform extraction and ethanol precipitation, resuspended in TE buffer (pH 8.0), 

15 and loaded and resolved on 12 % sequencing gels containing 7M urea. In the 

presence of inhibitors of the strand breakage-ligation function, radioactive cleavage 
products are detectable by autoradiography. Modifications of this assay whereby 
one strand of the DNA substrate is labeled with an affinity tag such as biotin and the 
other is radiolabeled or fluorescently labeled should facilitate rapid separation and 

20 detection of cleavage products using streptavidin coated columns or plates, resulting 
in higher assay throughput. 

Gyrase activity assays: DNA replication: 

Early work by Fuller and Kornberg revealed that a partially purified crude 

25 soluble fraction derived from Escherichia coli cells (designated fraction II) contained 
the components necessary for replication of plasmids containing oriC (E. coli 
chromosomal origin of replication). Replication mediated by this fraction specifically 
required supercoiled plasmids. Although the exact makeup of the protein complex 
mediating the replication was not known, the replication reaction was inhibited by 1) 

30 rifampicin, and 2) nalidixic acid and novobiocin, indicating essential roles for both 
RNA polymerase and DNA gyrase, respectively. Subsequently the reaction was 
reproduced using replication machinery reconstituted from purified protein HU, DnaA, 
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DnaC, DnaB, single stranded binding protein (SSB), primase, DNA polymerase 
holoenzyme, RNA polymerase holoenzyme and GyrA/GyrB. 

The requirement for gyrase activity for replication is exploited for the 
identification of gyrase inhibitors using a replication-based high throughput screen. 

5 Gyrase specific inhibitors are identified from the overall pool of replication inhibitors 
using the secondary assays detailed below. Screening for inhibitors of gyrase in a 
setting where gyrase is participating in an overall reaction that is essential in bacteria 
might better select physiologically relevant inhibitors 

. An assay suitable for high throughput screening of inhibitors of replication 

10 (including gyrase and DnaA inhibitors) is based on the replication reaction of Kaguna 
and Kornberg. This reaction was set up as follows; standard reaction in 25 pi: 40 mM 
Hepes (pH 7.6), 2 mM ATP, 0.5 mM GTP, CTP and UTP, 50 pg/ml bovine serum 
albumin, 6 mM phospho creatine, 100pM dATP, dGTP, dCTP and dTTP, y- 33 ? dTTP 
(50-150 cpm/pmol total nucleotides) 11mM magnesium acetate,100 pg/mL creatine 

15 kinase,85 ng SSB, 48 ng DnaB, 40 ng DnaC, 20 ng primase, 160 ng DNA 

polymerase III holoenzyme, 800 ng RNA polymerase, 150 ng GyrA, 350 ng GyrB, 
120 ng DnaA, 2.5 units topoisomerase 1, 190. ng HU, 0.15 ng Rnase H 200 ng 
supercoiled plasmid template. The reaction is assembled at 0 °C and initiated by 
incubation at 30°C. Replication reactions are terminated by the addition of EDTA to 

20 20 mM. Incorporation of nucleotides into DNA is measured by filtration through 96 
well DEAE filter plates and counting retained radioactivity. 

Compounds inhibiting gyrase activity in Alloiococcus otitidis are found as part 
of a larger program directed at replication. This reaction described above uses the 
replication machinery of a gram-negative organism, which differs somewhat from the 

25 replication machinery of gram positives such as Staphylococcus aureus with respect 
to the specific protein subunits involved. Therefore a similar system specific to 
Alloiococcus otitidis is assembled from the relevant proteins purified from 
Alioiococcus otitidis. Several techniques are then utilized to determine the interaction 
of inhibitors with Gyr A and GyrB. These are described in the literature and include 

30 a) Nuclear magnetic resonance; and b) Capillary electrophoresis. 
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Example 23 

ALLOIOCOCCUS OTfTIDIS ENCODED CELL WALL BIOSYNTHETIC ENZYMES MURA 

Bacterial cell wall peptidoglycan (murein) is a large macromolecule of periodic 

5 structure whose basic unit, a disaccharide-peptapeptide, is polymerized linearly via 
the disaccharide motif and cross-linked laterally via the peptide motif. The process of 
bacteria cell wall biosynthesis starts from the transferase MurA, which transfers the 
addition of an enolpyruvyl moiety to the 3'-hydroxy!-UDP-N-acetyl glycosamine 
(UDP-GluNAc). Subsequently, the reductase MurB reduces the enoi ether to the 

10 lactyl ether, utilize one equiv. of NADPH and a solvent proton to form UDP-A/-acetyl 
muramic acid (UDP-MurNAc). Next a series of ATP dependent amino acid ligases 
(MurC, MurD, MurE and MurF) catalyze the stepwise synthesis of the pentapeptide 
side chain using the newly synthesized carboxylate as the first acceptor site. Each 
enzyme is responsible for the addition of one more residue except MurF, catalyzes 

15 D-ala-D-ala. MurE in gram negative bacteria catalyzes the meso-2, 6- 

diaminopimelate (DAP), while in gram positive bacteria MurE catalyzes L-lysine. 

The product of MurF, UDP-NAM pendapeptide is the final product of the 
cytoplasm enzymes and is the most important precusor for further peptidoglycan 
biosynthesis. UDP-MurNAc pendapeptide is then and catalyzed at the plasma 

20 membrane by the membrane bound enzymes such as the translocase MraY and 
transferase MurG. 

UDP-A^acetyiglucosamine enolpyruvyl transferase (MurA) catalyzes the first 
committed step in bacterial cell wall biosynthesis. The enzyme transfers an 
enolpyruvyl group from phosphoenolpyruvate (PEP) to UDP-A/-acetylglucosamine 

25 (UDP-GluNAc) to the 3-OH of UDP-GlcNAc by an addition-elimination mechanism 
that proceeds through a tetrahedral ketal intermediate. MurA product enolpyruvate 
UDP-A^acetylgiucosamine (EP-UNAG) is a precursor to UDP- N-acetylmuramate 
(UDP-MurNAc), an essential building block for the bacterial cell wall. MurA is 
conserved across both gram-positive and gram-negative bacterial species: gram- 

30 negative bacteria have one copy of the murA and gram-positive bacteria have two 
copies. Alloiocbccus otitidis murA was identified as described in Example SyTable 4 
and its genomic structure set forth in Seq. ID No. 101 . The amino acid sequence of 
the protein encoded by this gene is set out in Seq. Id No. 102. 
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Alloiococcus otitidis murA as a target for anti-infective development 

MurA in E. coli and Streptococcus pneumoniae has been shown to be 
essentia! by gene deletion technique. The essentiality of MurA in gram-positive 
bacteria such as Streptococcus pneumoniae was demonstrated in that its deletion is 
5 fetal. No mammalian homolog to MurA has been reported. MurA is specifically 

inhibited by the natural product antibiotic fosfomycin. Thus the importance of MurA in 
peptidoglycan biosynthesis makes it an attractive target for the design of novel 
antibacterial agent. 

10 Assays for measuring MurA function 

Phosphate detection: 

MurA activity is detected by quantitating the UDP-GluNAc-dependent Pi from 
PEP and assayed by Lanzetta's malachite Green-ammonium molybdate assay. Pi is 
15 quantitated by measuring the optical density at A660 nm. 

Coupled assay with MurB: 

A coupled assay in access of MurB, which reduces the MurA product EP- 
UNAG G to UDP-MurNAc, couples the MurA transferase activity with NADPH 
20 oxidation. The oxidation of NADPH is monitored at 340 nm and is stoichometric with 
the production of EP-UNAG. 

Fluorescence experiments 

Fluorescence experiments to detect murA are performed using the 
25 hydrophobic fluorescence probe 8-anilino-1 -naphthalene sulfonate (ANS). The 
fluorescence quenching of MurA/ANS solutions upon addition of UDP-GlcNAc or 
pyruvate-P is concentration dependent and in a saturating manner. 

Isothermal titration calorimetry 

30 The binding of UDP-GluNAc to MurA is studied in the absence and presence 

of the antibiotic fosfomycin by isothermal titration calorimetry. Fosfomycin binds 
covalently to MurA in the presence of UDP-GluNAc and also in its absence as 
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demonstrated by MALDI mass spectrometry. Novel Fosfomycin analogs and other 
antibiotics that bind to murA are also identifiable using isothermal titration chemistry. 

Capillary electrophoresis-based enzyme assay 

5 A capillary electrophoresis-based enzyme assay for MurA is described by Dai 

and colleagues . This method, based on UV detection, provides baseline separation 
of one of the reaction products, EP-UNAG, from substrates PEP and UDP-GIcNAc 
within 4 min. The other product, phosphate, is not detectable by UV at 200 nm. 
Quantitation of individual components, substrates or product, is be accomplished 

10 based on the separated peaks. This assay is also used to detect novel antibiotics, 
which inhibit murA activity. 

Example 23 

ALLOIOCOCCUS OTtTIDIS ENCODED CELL WALL BIOSYNTHETIC E NZYMES MUFtB 

15 

MurB, the UDP-^acetyl enolpyruvyl glucosamine reductase, commits the second 
step of bacterial cell wall biosynthesis in cytoplasm and is responsible for the reduction of 
the enol ether to the lactyl ether, utilizes one equiv. of NADPH and a solvent proton. The 
product of MurB is UDP-N-acetylmuramic acid (UDP-MurNAc), the linker of the peptide 

20 and glycan portions of ceil wall precursor UDP muramyl-pentapeptide. MurB from E. coll 
is a 342 amino acid protein, which has a distinctive yellow color characteristic of bound 
flavin as its co-factor. The biochemistry characterization and X-ray crystal structure of 
MurB in E. coli, in Staphylococcus aureus and Streptococcus pneumoniae have been 
studied extensively. The gene Alloiococcus oitidis murB was identified as disclosed as 

25 described in Example 5, and is set out in Seq. ID No. 39. The amino acid sequence of 
the protein encoded by this gene is set out in Seq. ID No. 40. 

Alloiococcus oitidis murB as a target for anti-infective development 

30 The essentiality and unique function of MurB in prokaryotic cells and the 

absence of homologue in eukaryotic cells make it an attractive novel antibacterial 
target. To date, no small molecule inhibitors of MurB have been reported. 
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Alloiococcis oititidis ORF-1263 {murB ) (Seq. ID No. 39) encodes enzyme 
UDP-A^acetylenolpyruvylglucosamine Reductase (MurB) as shown by sequence 
homology. 

5 Assays for measuring MurB activity 

Spectrophotometry assay monitoring NADPH consumption: 

MurB activity is typically monitored by its biochemical reaction in which 
NADPH reduces the bound FAD and resulting decrease in absorbance at 340 nm. 
Enzyme is maximally activated in the presence of K+, NH 4 at cation concentrations 
10 between 1 0-50 mM. 



Coupled assay with MurC: 

In designing an end point assay for high through put screen (UTS), a novel 
coupled assay in access of UDP-MurNAc L-alanine synthase {MurC) was developed 

15 at Wyeth. This assay utilizes the biochemically synthesized MurA product EP-UNAG 
as substrate, coupled with limited MurB and excess MurC in the reaction with all 
other substrates/components involved. In this assay, MurB is responsible for the 
reduction of the enol ether to the iactyl ether, and the follow up enzyme MurC 
catalyzes the ATP dependent ligation of the first of the five amino acids of UDP- 

20 peptapeptide with a release of one molecule of phosphate. After 60 minutes of 
incubation, color reagent malachite green was added and phosphate was detected 
spectrophotometrically. 

Fluorescence binding assay 

25 A fluorescence method developed at Wyeth is used to determine the binding 

potency (Kd value), stoichiometry and nature of binding site of substrates and 
inhibitors interactions with MurB enzymes. This assay is based on changes in 
intrinsic fluorescence of inhibitor and/or enzyme, upon formation of enzyme-inhibitor 
complex. Oxidized form of MurB consists of two fluorescent groups, namely 

30 tryptophan residues and the cofactor FAD. Upon binding inhibitor or substrate, local 
changes in the solvent environment of these groups or overall conformational and 
electronic changes occur in the enzyme due to which the fluorescence emission is 
altered. For instance, inhibitor binding significantly quenched the fluorescence and 
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altered the solvent environment of FAD to a less polar environment. The changes in 
the fluorescence of the FAD moiety are used to estimate binding constants for MurB 
inhibitors. Binding experiments are set up in which a fixed concentration of enzyme is 
titrated with increasing concentrations of the inhibitor. In typical inhibitor binding 
5 experiments, the fluorescence emission of the FAD moiety is quenched due to 
specific interactions of the inhibitor with MuiB enzymes and the binding site was 
saturated at micromolar concentrations of inhibitor. The changes in the fluorescence 
are fitted to mathematical binding models to determine binding affinity. 

10 Temperature-jump isothermal denaturation procedure 

Temperature-jump isothermal denaturation procedure with various methods 
of detection is used to evaluate the quality of putative inhibitors of MurB discovered 
by high-throughput screening. Three optical methods of detection-ultraviolet 
hyperchromicity of absorbance, fluorescence of bound dyes, and circular dichroism- 
15 as well as differential scanning calorimetry are used to dissect the effects of two 
chemical compounds and a natural substrate on the enzyme. The kinetics of the 
denaturation process and binding of the compounds detected by quenching of flavin 
fluorescence are used to quantitate the dose dependencies of the ligand effects. 

20 NMR studies 

NMR studies are performed using perdeuterated, uniformly 1 3C/1 5N-labeled 
samples of MurB. In the case of substrate-free MurB t one or more backbone atoms 
are assigned for 334 residues (96%). For NADP+-complexed MurB, one or more 
backbone atoms are assigned for 313 residues. The strategies used for obtaining 
25 resonance assignments are known. Localizing the NADP+ binding site on the MurB 
enzyme is also studied by NMR methodology. 

Example 25 

ALLOIOCOCCUS Q7777D/S ENCODED CELL WALL BIOSYNTHETIC ENZYME, MUFtC 

30 

Uridine diphosphate-N-acetylmuramate:L-alanine ligase (MurC) catalyzes the. 
third chemical step of bacterial cell wall biosynthesis. This enzyme is a honribosomal 
peptide ligase which utilize ATP to form an amide bond between L-alanine and UDP- 
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N-acetylmuramic acid (UDP-MurNAc). This ATP-dependent ligation adds the first of 
five amino acids to the sugar moiety of the peptidoglycan precursor. Also, in this 
reaction, ATP is converted to ADP with release of one molecule of inorganic 
phosphate. Thus MurC reaction is an essential step in cell wall biosynthesis for both 
5 gram-positive and gram-negative bacteria. The genetic, biochemistry analysis and 
crystal graphic studies of MurC in gram-negative bacteria E. coli have been 
extensively studied. Characterizations of MurC in other pathogens such as 
Staphylococcus aureus and Pseudomonas aeruginosa have also been documented. 

10 Ailoiococcis otitidis encoded MurC as a target for anti-infective development 

The Ailoiococcis otitidis ORF-2602 (murC, Seq. ID No. 95) encodes enzyme 
UDP-MurNAc:L-alanine ligase {MutC) as determined by sequence homology. This 
enzyme presents a target for the development of novel anti-infectives to treat the 
15 disease(s) caused by this pathogen. Novel compounds identified using combinatorial 
chemistries are assayed for their inhibitory effect on MuiC activity using one of the 
asssays set out below. 

Assays for measuring MurC activity 
20 Spectrophotometric assay detecting phosphate release: 

MurC activity is detected by the inorganic phosphate production. Typically 
the reaction mixture contains substrates ATP, L-aianine, UDP-MurNAc, DTT, MgCI 2 
and MurC enzyme. After 20 minutes incubation, the reaction is quenched with the 
addition of malachite Green-ammonium molybdate for a colored reaction. 
25 Absorbance at 660 nm is read 5 minutes after the quench. Absorbance values are 
converted to concentration of Pi with standard curves using KH 2 P0 4j which is 
prepared under identical conditions without the enzyme MurC. 

Spectrophotometric assay detecting formation of ADP 

30 Due to the conversion of ATP to ADP in MurC reaction, the production of 

ADP is monitored in coupled enzymes spectrophotometrically. In this reaction, in 
addition to MurC substrate UDP-MurNAc, L-alanine and ATP, NADH, 
phosphoenolpyruvate, MgCI 2 and (NH 4 ) 2 S0 4 , two other coupled enzymes pyruvate 
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kinase and lactase dehydrogenase are also presented. Reaction mixtures without 
ATP and MurC are incubated at 37°C for 10 min before ATP is added for another 
minute. Reaction is then started by the addition of MurC. The decrease of NADH 
absorbance at 340 nm is monitored spectrophotometrically. One unit of activity 
5 corresponds to 1 umol of ADP formed per hour. 

L-Alanine radio-labeled assay: 

The MurC enzyme activity in this assay is measured as endpoint using 14 C-L- 
alanine and ATP incubated with MgCI 2 , and (NH 4 )2S0 4 in 100 mM Tris/HCI, pH 8.0. 
10 Reaction is initiated by the addition of the catalytic amounts of MurC. Samples of the 
reaction mixture are then mixed with glacial acetic acid and then stored at 4°C. 
Remaining 14 C -L-alanine is separated from 14 C -UDPMurNAc on SCX columns run 
under vacuum. Quenched reaction samples are supplemented with equilibration 
buffer and counted using a liquid scintillation counter. 

15 

Example 26 

Alloiococcub otitidis encoded cell wall biqsynthetic enzymes MurD 

Bacterial UDP-N-acety!muramyl-L-alanine:D-glutamate ligase (MurD), a 
20 cytoplasmic peptidoglycan biosynthetic enzyme, catalyzes the fourth step of bacterial 
cell wall biosynthesis. In this reaction, MurD catalyzes ATP-dependent addition of D- 
glutamate to an alanyl residue of the UDP-N-acetylmuramyl-L-alanine (UDP- 
MurNAc-L-Ala) precursor, generating the UDP-MurNAc-dipeptide. The formation of a 
peptide linkage between the amino function of D-glutamate and the carboxy 
25 terminius of UDP-N-acetylmuramuamyl-L-alanine is generated through this reaction. 
The stoichiometric consumption of ATP supplies the energy needed for this peptide 
bond formation with concomitant generation of ADP and orthophosphate. The murD 
genes were cloned and characterized from gram-positive bacteria of Staphylococcus 
aureus and Streptococcus pyogenes, and gram-negative bacteria from Escherichia 
30 colij Haemophilus influenzae, Bacillus subtilis. Structures of MurD from E. coli and 
MurD complexed with its substrate UDP-MurNAc-L-Ala have been solved to 2.0 A 
resolution. The role of specific amino acids at the active site of MurD have been 
extensively studied using the ortholog and paralog amino acid invariants. Homologue 
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of this gene identified in Alloiococcus otitidis is described in Example 5/Table 4 (Seq. 
ID No 89). The protein encoded by the gene is set forth in Seq. ID. No. 90.. 

Alloiococcus otitidis encoded MurD as a target for anti-infective development 

5 

Due to its high specificity and essentiality, MurD is an attractive target for the 
development of novel antimicrobial agents. Alloiococcis otitidis ORF-2494, by 
sequence homology, has been shown to encode enzyme UDP-N-acetylmuramyl-L- 
alanine:D-glutamate ligase (MurD) (Seq. ID. No. 89). Inhibition of MurD activity is 
10 used to identify novel antimicrobial agents. 

Assays for measuring MurD activity 

Spectrophotometric assay detecting phosphate release: 

15 MurD activity in the presence or absence of a putative inhibitory molecule of 

MurD is detected by the orthophosphate production in test tube or in 96-well format. 
Typically the reaction mixture contains substrates ATP, D-glutamine, UDP-MurNAc- 
L-Ala, DTT, MgCI2 and MurD enzyme. After 20 minutes incubation, the reaction is 
quenched with the addition of malachite Green-ammonium molybdate for a colored 

20 reaction. Absorbance at 660 nm is read 5 minutes after the quench using Molecular 
Devices SpectraMax 250 plate reader. Absorbance values are converted to 
concentration of Pi using orthophosphate standards, which are prepared under 
identical conditions without the enzyme MurD. 

25 

Spectrophotometric assay for detecting formation of ADP in the presence or 
absence of a putative inhibitory mollecule of MurD: 

Due to the conversion of ATP to ADP in MurD reaction, the production of 
ADP is monitored with coupled enzymes of pyruvate kinase and lactase 
30 dehydrogenase spectrophotometrically. In this reaction, in addition to MurD 
substrate UDP-MurNAc-L-ala and ATP, MgCI 2 and (NH 4 )2S0 4 , there is also in 
significant access of NADH, phosphoenolpyruvate, and two coupled enzymes 
pyruvate kinase and lactase dehydrogenase. This protocol monitors ADP formation 
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in the MurD catalyzed reaction, in the presence or absence of a putative inhibitory 
mollecule of MurD, by the decrease of NADH absorbance at 340 nm. 

L-Glutamate radio-labeled assay: 
5 The MurD enzyme activity in the presence or absence of putative inhibitors of 

MurD is also measurable using D- 14 C- glutamate as an endpoint assay. The reaction 
mixture contains D- 14 C- glutamate UDP-MurNAc-L-Ala, ATP, MgCI 2 , (NH 4 ) 2 S0 4 in 
100 mM Tris/HCI, pH 8.0. An HPLC assay with online UV and flow scintillation 
detects the formation of UDP-MurNAc-L-Ala-D- u C Glu and ADP in each reaction. 

10 

Example 27 

ALLOIOCOCCUS OTITIDIS ENCODED CELL WALL BIOSYNTHETIC ENZYME, MURE 

The fifth step in the cytoplasmic peptidoglycan biosynthetic is catalyzed by 

15 MurE. In this step, the monomer units in the Escherichia coli and Staphylococcus 
aureus cell wall peptidoglycans differ in the nature of the third amino acid in the L- 
alanyl-gamma-D-glutamyl-X-D-alanyl-D-alanine side chain, where X is meso- 
diaminopimelic acid or L-lysine, respectively. Therefore, MurE from E. colils the 
UDP-N-acetylmuramoyl-L-alanyl-D-glutamate: meso-diaminopimelic acid ligase, and 

20 MurE from S. aureus is the UDP-N-acetylmuramoyl-L-alanyl-D-glutamate: L-lysine 
ligase. Thus represents the major difference of MurE from other murein enzymes in 
cytoplasm. The amino acid residues catalyzed by MurE plays a key role in the 
integrity of saccuius since it is directly involved in the peptide cross-linkage. MurE 
reaction is also ATP-dependent, which supplies the energy needed for the peptide 

25 bond formation with concomitant generation of ADP and orthophosphate. 

The essentiality of MurE has been well documented in E. coli, in S. aureus, as 
well as other pathogens such as Haemophilis influenzae, Vibrio cholerae and 
Corynebacterium glutamicum. Gene murE has been shown to be essential in 
bacteria. Homoiogue of this gene identified in Alloiococcus otitidis is described in 

30 Example 5/Table 4 (Seq. ID No 25). The protein encoded by the gene is set forth in 
Seq. ID No. 26. 

Alloiococcus otitidis MurE as a target for anti-infective development 
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AJIoiococcis otitidis ORF-851 , by sequence homology encodes enzyme UDP- 
N-acetylmuramyl-L-aianine-D-glutamate ligase: meso-diaminopimelic acid/or L- 
Lysine (MurE) (Seq. ID No 25). MurE activity in the presence or absence of a 
5 putative inhibitory molecule of MurE activity is used to identify novel antimicrobial I 
agents, which may be used ti treat disease caused by Alloiococcis otitidis. 

Assays for measuring MurE activity 

Radio labeled substrate assay: meso-A2pm-adding activity 

10 Activity of MurE from Alloiococcis otitidis in the presence or absence of a 

putative inhibitory molecule of MurE activity is measured by using radio-labeled 
meso- 14 C A2pm mixing with ATP, MgCI 2 , UDP-MurNAc-L-Aia-D-Glu, DTT in 100 mM 
Tris/HCI and MurE from Alloiococcis otitidis . 

15 Radio labeled substrate assay: L-lysine adding activity 

Activity of MurE from Alloiococcis otitidis in the presence or absence of a 
putative inhibitory molecule of MurE activity is measured by using radio-labeled UDP- 
MurNAc-L-Ala-D-14C-Glu mixing with ATP, MgCla. DTT, L-lysine in 100 mM Tris/HCI 
and MurE from Alloiococcis otitidis. 

20 In both cases, mixtures are incubated at 37°C for 30 min, and reactions 

stopped by the addition of acetic acid. Reaction product is separated by high votage 
electrophoresis in 2% formic acid for 45 min. The radio active spots corresponding to 
substrate and reaction product are detected by overnight autoradiography, or with 
radio scanner. The spots are also cut out and counted using liquid scintillation 

25 . counter. 

Example 28 

ALLOIOCOCCUS OTITIDIS ENCODED CELL WALL BIOSVNTHETIC EN ZYME, MURF 

The D-alanyl-D-alanine-adding enzyme MurF encoded by the murF gene 
30 catalyzes is the last step of the cytoplasmic peptidoglycan biosynthesis. MurF 

performs the ATP-dependent formation of UDP-N-acetylmuramyl-L-gamma-D-Glu- 
meso-diaminopimelyl-D-Ala-D-Ala (UDP-MurNAc-pentapeptide). The product of 
MurF, UDP-MurNAc pendapeptide, is the final product of the cytoplasm enzymes and 
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is the most important precusor for further peptidoglycan biosynthesis. UDP-MurNAc 
pendapeptide is then catalyzed by the plasma membrane bound enzymes such as 
the translocase MraY and transferase MurG. Homologue of this gene identified in 
Alloiococcus otitidis is described in Example 5/Table 4 (Seq. ID No 3). The protein 
5 encoded by the gene is set forth in Seq. ID No. 4. 

Alloiococcus otitidis MurF as a target for anti-infective development 

Due to its high specificity, essentiality, and importance of its product UDP- 
MurNAc pentapeptide, MurF is attractive as an antibacterial target. The Alloiococcis 
10 otitidis ORF-48, by sequence homology ,encodes enzyme UDP-N-acetylmuramyl-L- 
alanine-D-glutamate ligase: meso-diaminopimelic acicl/or L-Lysine -alanyl-D-alanine- 
adding enzyme (MurF) (Seq. ID No. 3). MurF activity in the presence or absence of a 
putative inhibitory molecule of MurF activity is used to identify novel antimicrobial 
agents, which may be used to treat disease caused by Alloiococcis otitidis. 



15 



Assays for measuring MurF activity 



Spectrophotometric assay detecting phosphate release: 

Activity of MurF from Alloiococcis otitidis in the presence or absence of a 

20 putative inhibitory molecule of MurF activity is detected by the inorganic phosphate 
release in the ATP dependent MurF reaction. This assay detects nonomole amount 
of Pi in the reaction mixture contains substrates ATP, D-ala-D-ala, UDP-MurNAc- 
tripeptide, DTT, MgCI 2 and MurF enzyme. After 5 minutes incubation, the reaction is 
quenched with the addition of malachite Green-ammonium molybdate for a colored 

25 reaction. 

Coupled spectrophotometric assay detecting formation of ADP 

Due to the conversion of ATP to ADP in MurF reaction, the production of ADP 
in the presence or absence of a putative inhibitory molecule of MurF activity, is 
30 monitored with coupled enzymes of pyruvate kinase and lactase dehydrogenase 
spectrophotometrically. In this reaction, the decrease at 340 nm is observed as 
NADP is consumed in MurF reaction process. The reaction typically contains tris 
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buffer, substrates ATP, D-ala-D-ala, UDP-MurNAc-tripeptide, DTT, MgCl 2 , 
phosphoenopyruvate, NADPH and MurF enzyme. 

Example 29 

5 ALLOJOCOCCUS OTITIDIS ENCODED CELL WALL BIOSYNTHET1C ENZYME, MURG 

MurG, the last enzyme involved in the intracellular phase of peptidoglycan 
synthesis, is a membrane-associated gly cosy (transferase. MurG catalyzes the 
transfer of /^acetyl glucosamine from UDP to the C4 hydroxyl of a lipid-linked N- 

10 acetyl muramic acid derivative (lipid I) to form lipid II. Lipid II is a linked disaccharide 
that is the minimal subunit of peptidoglycan. Once lipid II is formed, this disaccharide 
is translocated across the bacterial membrane where it is polymerized and cross- 
linked to form the peptidoglycan layers. MurG has been shown to be essential for 
bacterial survival. The inactivation of MurG gene rapidly inhibits peptidoglycan 

15 synthesis in exponential growing cells. As a result, various alterations of cell shape 
are observed, and cell lysis finally occurs. Homologue of this gene identified in 
Alloiococcus otitidis is described in Example 5/Table 4 (Seq. ID No 87). The protein 
encoded by the gene is set forth in Seq. ID No. 88. 

20 Alloiococcus otitidis MurG as a target for anti-infective development 

MurG is shown to be associated with the inner face of cytoplasmic 
membrane, and establishing that the entire peptidoglycan monomer unit assembled 
before being transferred across the membrane. MurG is a key enzyme at the border 

25 line between cytoplasmic and membrane of pepdidoglycan synthesis, thus makes it 
an attractive target for novel antibacterial agent. Further, no mammalian analogues 
of MurG have been identified. Due to its high specificity, essentiality, and importance, 
MurG is attractive as an antibacterial target. 

The Alloiococcis otitidis ORF-24B2 has been shown to encode, by sequence 

30 homology, glycosyltransferase (MurG) (Seq. ID No ). MurG activity in the 

presence or absence of a putative inhibitory molecule of MurG activity is used to 
identify novel antimicrobial agents, which may be used to treat disease caused by 
Alloiococcis otitidis. 
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Assays for measuring MurG function 

Radiolabeled reaction 

Activity of MurG from Alloiococcis otitidis in the presence or absence of a 
putative inhibitory molecule of MurG activity is measured by using 14 C labeled N- 
UDP-GluNAc in the reaction containing UDP-MurNAc-pentapeptide, MgCI 2 , ATP and 
MurG protein. The reaction is stopped after 30 min incubation and by boiling for 3 
min. The reaction mixtures are applied to a Whatman I filter paper and subject to 
descending chromatography overnight. Radioactivity is located and countered with a 
scanner. This assay is also used to identify the specificity of inhibitor of MraY or 
MurG, based on the detection of radiolabeled 14 C GluNAc incorporated into 
membrane precursors. 



Fluorometric assay 

15 Based on the decrease in NADPH fluorescence at 465 nm, MurG reaction is 

also monitored in a reaction mixture of HEPES buffer, MgCI 2 , Triton, 
phosphoenolpyruvate, and coupled enzymes of lactic dehydrogenase and pyruvate 
kinase, UDP-GluNAc and synthesized lipid I analogue in the presence or absence of 
putative inhibitors of MurG activity. One micromolar UDP corresponds to 500- 

20 fluorescence unit under the instrument setting. 

Example 30 

ALLOIOCOCCUS OT1TID1S ENCODED BY HMG CPA REDUCTASE (MVAA) 

25 Two pathways for isopentenyl diphosphate (I PP) synthesis have been 

described in bacteria: the mevalonate pathway and the non-mevalonate (MEP or 
GAP-pyruvate) pathway. The mevalonate pathway predominates in the 
archaebacteria, gram-positive organisms, yeast and mammals; whereas the MEP 
pathway is found in gram-negative organisms, B. subtilis, chlamydia, and 

30 mycobacterium. The first HMG CoA reductase gene to be sequenced was cloned 
from P. mevalonii, in which HMG CoA reductase permits growth on mevalonate as a 
sole carbon source. A number of genes of the mevalonate pathway were identified in 
S. aureus, S, epidermidis, S. pyogenes, S. pneumoniae, E. faecalis and E. faecium. 
One of the genes, which encodes for HMG-CoA reductase (mvaA), when deleted 
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severely attenuated for virulence in a mouse model indicating that mvaA is essential. 
Due to its high specificity, essentiality, and importance, mvaA is attractive as an 
antibacterial target. Homologue of this gene identified in Alloiococcus otitidis is 
described in Example 5/TabIe 4 (Seq. ID No 37). The protein encoded by the gene is 
5 set forth in Seq. ID No. 38. 

HMG-CoA reductase (MvaA) as a target for anti-infective development 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
10 homology, HMG-CoA reductase {mvaA) (Seq. ID No 37). MvaA activity in the 
presence or absence of a putative inhibitory molecule of HMG-CoA reductase 
(mvaA) activity is used to identify novel antimicrobial agents, which may be used to 
treat disease caused by Alloiococcus otitidis, 

15 Assays for measuring HMG-CoA reductase (mvaA) activity 

MvaA is purified by standard methods using widely available molecular tags 
following expression at high level from E. coll Enzymatic activity is monitored in the 
presence or absence of a putative inhibitory molecule of HMG-CoA reductase activity 
by following oxidation of NADPH to NADP spectrophotometrically at 340 nm. The 
20 assay is carried out in the following buffer: 0.25 mM NADPH, 0.25 mM HMG-CoA, 50 
mM NaCI, 1 mM EDTA, 5 mM DTT, 25 mM KH 2 P0 4 (pH 7.5). The assay is 
amenable to HTSjn high density screening microtiter plates. 

25 Forward reaction: Activity of HMG-CoA reductase (mvaA) from Alloiococcus 

otitidis in the presence or absence of a putative inhibitory molecule of HMG-CoA 
reductase activity is measured by reductive deacylation of HMG-CoA to mevalonate 
as measured the consumption of NADPH to NADP. Unlike other class II HMG Coa 
reductases, MvaA from Alloiococcus otitidis, like S. aureus, can use either NADPH or 

30 NADH cofactor in the reaction. The following kinetic data describe the reaction: 
K^hmg coa) = 40 pM, K^adph, = 70 pM, KfnfHADp) = 100 pM (12). This assay is 
inhibitable by the statin drug fluvastatin; the Kj was measured at 320 pM, which is 
four orders of magnitude higher than the K f for class I HMG-Coa reductases. 
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Reverse reaction: The oxidative acylation of mevalonate to HMG-CoA in the 
presence or absence of a putative inhibitory molecule of HMG-CoA reductase activity 
is also monitored. The following kinetic data describes the reaction: Kmfmevatonate) = 
670 pM, Km(coASH) = 390 pM, K^nadpj = 580 pM (12). 

5 

Example 31 

ALLOIOCOCCUS OTITIDIS ENCODED DIPHOSPHOMEVALON ATE DECARBOXYLASE (MVAD) 

Diphosphomevalonate decarboxylase, encoded by mvaD, the final enzyme 
10 acting in the mevalonate pathway of IPP synthesis was cloned from S. aureus by 
Wilding et a/ in 2000. Insertional inactivation of mvaD could only be accomplished 
when the strains were supplemented with mevalonate, indicating that mvaD is 
essential. The final step of the mevalonate pathway leading to IPP is the 
decarboxylation and dehydration of mevalonate-5-pyrophosphate to form isopentenyl 
15 diphosphate by MvaD (diphosphomevalonate decarboxylase). 

MvaD homologues are well represented in gram-positive organisms (10). 
Phylogenetic analysis revealed that the cluster of gram-positive enzymes (39-80% 
identity) were well separated from the eukaryotic homologues, suggesting utility as 
an antibacterial target. The Aifoiococcis otitidis ORF- 1275b has been shown to 
20 encode, by sequence homology, diphosphomevalonate decarboxylase (MvaD,) (Seq. 
ID No. 43). MvaD activity in the presence or absence of a putative inhibitory molecule 
of diphosphomevalonate decarboxylase (MvaD,) activity is used to identify novel 
antimicrobial agents, which may be used to treat the disease(s) caused by 
Alloiococcus otitidis: The protein encoded by the gene is set forth in Seq. ID No. 44. 

25 
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Example 32 

ALLOIOCOCCUS OTITiDIS ENCODED HMG CP A SYNTHASE (MVAS) 

The second step of the mevalonate pathway leading to IPP is the irreversible 
5 condensation of acetoacetyl-CoA and acetyi-CoA to form HMG-CoA by MvaS (HMG 
Co A synthase). It has been shown that mvaS knockout mutant of S. pneumoniae 
was attenuated for virulence. Due to its high specificity, essentiality, and importance, 
mvaS is attractive as an antibacterial target. Homologue of this gene identified in 
Alloiococcus otitidis is described in Example 5/Table 4 (Seq. ID No 35). The protein 
10 encoded by the gene is set forth in Seq. ID No. 36. 

HMG COA SYNTHASE (MVAS) AS A TARGET FOR ANTI-INFECTIVE DEVELOPMENT 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
15 homology, MvaS (HMG CoA synthase) (Seq. ID No. 35). MvaS activity in the 

presence or absence of a putative inhibitory molecule of HMG-CoA synthase (mvaS) 
activity is used to identify novel antimicrobial agents, which may be used to treat 
disease caused by Alloiococcus otitidis. 



20 Assays for measuring MvaS function 

MvaS is purified by standard methods using widely available molecular tags 
following expression at high level from E. coli. HMG-CoA synthase activity in the 
presence or absence of a putative inhibitory molecule of HMG-CoA synthase {mvaS) 
is assayed by measuring the loss of the enolate form of acetoacetyl-CoA 

25 spectrophotometrically. The reaction is carried out in a buffer containing 50 mM Tris 
(pH 9.75), 5.0 mM MgCI 2l 500 pM acetyl-CoA, 20 pM acetoacetyl-CoA and enzyme. 
The enolate formed is monitored at 302 nm; therefore, as the acetoacetyl-CoA is 
consumed the signal is depleted. Using this assay the following kinetic data is 
measured: K^ty,^) = 350 pM; K>W-*^ = 1 0 pM. This assay is amenable 

30 to HTS in high- high density screening microtiter plates. 
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Example 33 

ALLOIOCOCCUS OTITIDIS ENCODED NICOTINAMIDE ADENINE D INUCLEOTIDE ADENYLYL 

TRANSFERASE (NADD) 

5 Nicotinamide adenine dinucleotide (NAD) is an essential molecule in all living 

cells. NAD is synthesized via a multi-step de novo pathway or via a pyridine salvage 
pathway. The enzyme nicotinic acid mononucleotide adenylyl transferase (NaMN AT, 
EC2.7.7.18) catalyzes the conversion of ATP and nicotinic acid mononucleotide 
(NaMN) to nicotinic acid adenine dinucleotide (NaAD). The nadD gene, encoding 

10 bacterial NaMN AT, is essential for NAD biosynthesis and bacterial cell survival. 
NadD contains well-conserved the nucleotidyl transferase consensus sequence 
(GXFXXXHXGH). The adenylyl transferase encoded by the nadD gene prefers 
NaMN over nicotinomide mononucleotide (NMN) as substrate. Due to its high 
specificity, essentiality, and importance, nadD is attractive as an antibacterial target. 

15 Homologue of this gene identified in Alloiococcus otitidis is described in Example 
5/Table 4 (Seq. ID No 91). The protein encoded by the gene is set forth in Seq. ID 
No. 92. 

NICOTINAMIDE ADENINE DINUCLEOTIDE ADENYLYL TRANSFERASE (NADD) 
20 AS A TARGET FOR ANTI-INFECTIVE DEVELOPMENT 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
homology, niotinomide adenine dinucleotide adenyl transferase (NadD) (Seq. ID No. 
91). NadD activity in the presence or absence of a putative inhibitory molecule of 
25 NadD activity is used to identify novel antimicrobial agents, which may be used to 
treat disease caused by Alloiococcus otitidis. 

Assays for measuring NadD function 
Discontinuous assay 

30 NadD activity in Alloiococcus otitidis is measured in the presence or 

absence of a putative inhibitory molecule of NadD activity. NadD converts 
nicotinic acid mononucleotide (NaMN) and adenosine triphosphate (ATP) to 
nicotinic acid dinucleotide (NaAD) and pyrophosphate (PPi). Each PPj 
molecule produced by the NadD reaction is then converted to two phosphate 
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(Pi) molecules in the presence of inorganic pyrophosphatase (PPase). The Pi 
molecules present are quantitated with a malachite green reagent at 660 nm. 

HPLC-based assay: Enzyme activity is measured by HPLC quantitation 
of the reaction products. A neutralized aliquots from the reaction described 

5 above was injected into an HPLC system utilizing a 250 x4.6 mm Supelcosil 
LC-1 8 5\im reversed-phase column. The elution conditions: 9 min at 1 00% 
buffer A (0.1 M potassium phosphate buffer, pH6.0,6 min at up to 12% buffer B 
(buffer a, containing 20% methanol, 2.5 min at up to 45% buffer B, 2.5 min at 
up to 1 00% buffer B, and hold at 1 00% buffer B for 5.5 min. The eluate 

10 absorbance was monitored at 254 nm. 

Continuous assay 

In bacteria, NadD combines nicotinic acid mononucleotide (NaMN) and 
adenosine triphosphate (ATP) to form nicotinic acid adenine dinucleotide (NaAD). 

15 NadE then converts NaAD into nicotinamide adenine dinucleotide (NAD) in the 

presence of ammonia and ATP. In the assay, the NAD product is reduced to NADH 

j with alcohol dehydrogenase (ADH) and ethanol, thus permitting direct spectrometric 
detection of NADH at 340 nm wavelength. The coupled reaction above also includes 
inorganic pyrophosphatase (PPase) to prevent accumulation of the pyrophosphate 

20 byproduct from the consumption of ATP. 

Example 34 



25 




NAD is a central compound in cellular metabolism. The final metabolic 
step in the pathway is conversion of nicotinamide adenine dinucleotide - product of 
NadD reaction - to NAD, a step catalyzed by the enzyme NAD synthetase (NadE). 
NaMN - substrate for NadD - can be formed by three different enzymatic reactions: 
30 in the de novo pathway from quinolinate, in Preiss-Handler salvage pathway from 
nicotinic acid, and in the nucleoside salvage pathway by deamindation of 
nicotinamide mononucleotide. In bacteria, there are no known alternatives for the 
metabolic steps between NaMN and NAD. Mutants blocked in these steps cannot be 
recovered as auxotrophs since the required metabolites are not taken up by cells. In 
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the bacterial cells, the second substrate for NadE is ammonium, as opposed to 
glutamine for eukaryotes. NadE is an essential and conserved protein in the 
eubacterial nicotinamide adenine dinucleotide (NAD) biosynthesis pathway. 
Homologue of this gene identified in Alloiococcus otitidis is described in Example 
5/Table 4 (Seq. ID No 49). The protein encoded by the gene is set forth in Seq. II 
No. 50. 



Assays for measuring NadE function: 



10 



15 



The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
homology, niotinomide adenine dinucleotide adenyl synthase (NadE) (Seq. ID No. 
49). NadE activity, in the presence or absence of a putative inhibitory molecule of 
NadE activity is used to identify novel antimicrobial agents, which may be used to 
treat disease caused by Alloiococcus otitidis. 



Discontinuous assay: 

In assay, NadE converts nicotinic acid adenine dinucleotide (NaAD) into 
nicotinamide adenine dinucleotide (NAD) in the presence of ammonia and ATP. 
Each PPi molecule produced by the NadE reaction can then be converted to 
20 two phosphate (Pi) molecules in the presence of inorganic pyrophosphatase 
(PPase). The Pi molecules present can then be quantitated with a malachite 
green reagent at 660 nm. 

HPLC-based assay: 

25 Enzyme activity can be measured by HPLC quantitation of the reaction 

products. A neutralized aliquots from the reaction described above was injected 
into an HPLC system utilizing a 250 x4.6 mm Supelcosil LC-18 5um reversed- 
phase column. The elution conditions: 9 min at 100% buffer A (0.1 M potassium 
phosphate buffer, pH6.0,6 min at up to 12% buffer B (buffer a, containing 20% 

30 methanol, 2.5 min at up to 45% buffer B, 2.5 min at up to 1 00% buffer B. and 
hold at 100% buffer B for 5.5 min. The eluate absorbance was monitored at 254 
nm(1). 
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Continuous assay: 

Coupled NadD-NadE assay. NadD and NadE can be detected in one 
continuous coupled assay. In first reaction, NadD combines nicotinic acid 

5 mononucleotide (NaMN) and adenosine triphosphate (ATP) to form nicotinic acid 
adenine dinucleotide (NaAD). NadE then converts NaAD into nicotinamide adenine 
dinucleotide (NAD) in the presence of ammonia and ATP. In the assay, the NAD 
product is reduced to NADH with alcohol dehydrogenase (ADH) and ethanol, thus 
permitting direct spectrometric detection of NADH at 340 nm wavelength. The 

10 coupled reaction above also includes inorganic pyrophosphatase (PPase) to prevent 
accumulation of the pyrophosphate byproduct from the consumption of ATP (this 
method can be use as HTS format). 

NadE assay. In assay, NadE converts NaAD into nicotinamide adenine 
dinucleotide (NAD) in the presence of ammonia and ATP. The NAD product is 

15 reduced to NADH with alcohol dehydrogenase (ADH) and ethanol, thus permitting 
direct spectrometric detection of NADH at 340 nm wavelength. The reaction above 
also includes inorganic pyrophosphatase (PPase) to prevent accumulation of the 
pyrophosphate byproduct from the consumption of ATP (this method can be use as 
HTS format). 

20 

Example 35 

ALLOIOCOCCUS OTITIDIS ENCODED PUTATIVE MEMBRANE PROTEIN NORA 

An efflux transporter NorA that was originally identified in Staphylococcus 
25 aureus belongs to the family of multidrug resistance (MDR) transporters. NorA is 

encoded by chromosomally-iocated norA gene, it has broad substrate specificity and 
mediates resistance to various lipophilic and monocationic compounds such as 
ethidium bromide (EtBr), cetrimide, benzalkonium chloride, rhodamine 6G, 
tetraphenyiphosphonium (TPP), chloramphenicol as well as some hygrophilic 
30 quinolones such as norfloxacin, ciprofloxacin and oxafioxacin. Increased levels of 

norA expression are associated with single nucleotide changes upstream of norA in a 
putative promoter/operator region and lead to increased pleiotropic resistance. NorA 
is a putative membrane protein with 12 predicted membrane-spanning domains and 
is classified as a member of major facilitator superfamily (MFS), a subgroup of MDR 
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transporters characterized by the presence of 12-14 transmembrane segments and 
the use of proton motive force as an energy source for drug efflux. NorA homologs 
that belong to MFS family include Bmr and Bit of Bacillus subtiiis, EmeA of 
Enterococcus faecalis and PmrA of Streptococcus pneumonia. The expression of 
5 bmr gene in B. subtilis is upregulated by the product of adjacent bmR gene in the 
presence of inducers (rhodamine 6G and TPP), and there is an evidence that 
expression of norA in S. aureus is regulated by AlrS-AIrR two-component regulatory 
system. 

It remains unknown whether the efflux of various toxins is a primary function 
10 of NorA. When overexpressed in E. coli, norA produces resistance to a broad range 
of substrates including fluoroquinolones. Everted membrane vesicles prepared from 
nor>4-expressing E. coli exhibit energy-dependent transport of norfloxacin, the 
transfer is abolished by cyanide m-chlorophenylhydrazone (CCCP) and nigericin but 
not by valinomycin indicating that NorA-mediated transfer is coupled to the proton 
15 gradient of cell membrane. Norfloxacin uptake in everted vesicles as well as NorA- 
associated resistance phenotype is inhibited by reserpine and verapamil that also 
inhibit other MDR transporters and are toxic to mammalian cells. Histidine-tagged 
NorA (NorA-His) was recently overexpressed and purified from E. coli, reconstituted 
into both everted membrane vesicles and proteoliposomes and was shown to 
20 function as a self-sufficient efflux pump using fluorescent dye Hoechst 33342. Due to 
its high specificity, essentiality, and importance, norA is attractive as an antibacterial 
target. Homologue of this gene identified in Afloiococcus otitidis is described in 
Example 5/Table 4 (Seq. ID No 67). The protein encoded by the gene is set forth in 
Seq. ID No. 68. 

25 

NorA as a target for anti-infective development 

The Afloiococcis otitidis ORF- has been shown to encode, by sequence 
homology, NorA (Seq. ID No. 67). NorA activity in the presence or absence of a 
30 putative inhibitory molecule of NorA activity is used to identify novel antimicrobial 

agents, which may be used to treat disease caused by Alloiococcus otitidis.. Because 
of broad substrate specificity of NorA, NorA inhibitors should be particularly useful 
against pathogens that possess multiple drug resistance. 
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Whole-cell high-throughput screen (HTS) assay that measures NorA activity 
in the presence or absence of a putative inhibitory molecule of Alloiococcis otitidis 
NorA activity is used to identify potential inhibitors of NorA activity. The assay utilizes 
a subtilis strain (aaNA) that has both Bmr and Bit genetically inactivated while 
Alloiococcis otitidis NorA is supplied on the plasmid expression vector. The screen is 
based on the reversing of the resistance of aaNA to EtBr. The exponentially growing 
cells are inoculated into the wells of a 96-well plate to ODeoo^-OOl, the compounds 
are added at 20 pg/ml and EtBr is added at 10 pg/ml. Plates are incubated for 18 hrs 
at 37°C and examined for growth. Compounds that inhibit growth are subsequently 
tested in the presence/absence of EtBr for toxicity and effectivity. The efflux of EtBr 
from ceils is monitored as described previously. The exponentially growing cells are 
loaded with EtBr at a concentration of 10 Dg/ml for 20 min at 37°C in the presence of 
reserpine (20 Dg/ml). Cells are centrifuged, resuspended to an OD6oo=0.2 in a 
minimal medium GM1 alone or in the presence of inhibitor compound. Fluorescence 
of EtBr is monitored on a fluorimeter at an excitation □ of 530 nm and emission □ of 
600 nm.. 

Monitoring of Hoechst 33342 efflux 

The efflux of fluorescent dye Hoechst 33342 from either everted membrane 
vesicles prepared from Ailoiococcus otitidis His-NorA overexpressing E. coli or a 
proteoliposomes reconstituted with Alioiococcus otitidis His-NorA is also used to 
monitor NorA activity in the presence or absence of putative inhibitors of NorA. 
Everted membrane vesicles are diluted into 2 ml of 50 mM potassium HEPES (pH 
7.2), 8.5 mM NaCi, 2 mM magnesium sulfate at a final protein concentration of 40 
pg/mL NorA is activated by the addition of either 0.5 mM lactate or 0.1 mM Mg 2+ - 
ATP. Hoechst 33342 is used in a range of 12.5 to 200 nM. Inhibitors are added at 
various concentrations prior to the addition of Hoechst 33342. Fluorescence change 
is monitored at excitation and emission wavelenghths of 355 and 457 nm 
respectively in a RuoroMax spectrofluorimeter. For proteoliposome assay, the His- 
NorA proteoliposomes are diluted into a cuvette containing 2 ml of 20 mM potassium 
phosphate, 50 mM potassium sulfate, 2 mM magnesium sulfate (pH 7.0) at a protein 
concentration of 10 pg/ml. The inhibitor compounds and Hoechst 33342 are added at 
various concentrations and the fluorescence is measured as described previously. 
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Example 36 
Alloiococcus ormpis encoded Obg gtp&hf 

The obg gene is the second gene in a two-gene operon along with the stage- 
CD sporulation gene spoOB in B. subtilis. SpoOB is central to the phospho-relay 
signal cascade that initiates sporulation. Obg is a member of the GTPase 
superfamily by virtue of homology throughout a small portion of the protein that in 
other members of the family Is responsible for nucleotide (GTP/GDP) binding. Obg 
is essential for growth. Initiation of sporulation is thought to be triggered by changes 
in the GTP content of the cell; therefore, the presence of a GTP binding protein in an 
operon with a central player in the process is suggestive of a role for Obg in sensing 
GTP levels and transmitting a signal to SpoOB. 

It has been shown that Obg is involved in activation of the a 3 transcription 
factor in S. subtilis in response to environmental stress. Cells were depleted of Obg 
utilizing a construct that put obg under the control of an inducible (P^) promoter. 
Depletion of IPTG resulted in bacteria that failed to activate a 8 . These studies further 
showed by yeast-two-hybrid analysis that Obg interacted with several known a 3 
regulators, the so-called Rsb proteins. 

The role Obg plays in transmitting signals important for sporulation and 
activation of the stress sigma factor may be indicative of the activities that small GTP 
binding proteins carry out in triggering cell division in response to GTP levels. Due to 
its high specificity, essentiality, and importance, obg is attractive as an antibacterial 
target. Homologue of this gene identified in Alloiococcus otitidis is described in 
Example 5/Table 4 (Seq. ID No 71). The protein encoded by the gene is set forth in 
Seq. ID No. 72. 

Obg as a target for anti-infective development 

Obg is essential for bacterial viability. Conditional lethal alleles revealed that 
Obg is required for early events in sporulation and is involved in transmitting signals 
require for activation of the stress sigma factor. The Alloiococcis otitidis ORF- has 
been shown to encode, by sequence homology, obg (Seq. ID No.71). Obg activity in 
the presence or absence of a putative inhibitory molecule of Obg activity is used to 
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identify novel antimicrobial agents, which may be used to treat disease caused by 
Alloiococcus otitidis,. 

Nucleotide binding 

5 Obg binding to nucleotide in the presence or absence of putative 

antimicrobials, which inhibit Obg activity, is monitored by a simple filter-binding 
assay. Alloiococcus otitidis Obg (1-5 ug) is incubated with c^P-GTP (0.2 uCi) in a 
buffer consisting of 50 mM Tris (pH 8.5), 1 .5 mM MgCI 2 , 0.1 mM EDTA, 200 mM KCI, 
10% glycerol for 30 minutes to 3 hours at 37'C. A portion of the reaction mix is 

10 spotted on nitrocellulose membrane, washed (50 mM Tris (pH 8.5), 1 .5 mM MgCI 2 , 1 
mM DTT) and dried. The membrane is then exposed to X-ray film. Alternatively, the 
spots are excised and counted. This assay is directly amenable to HTS using filter 
plates. 

15 GTPase activity 

The GTP hydrolytic activity of Obg is monitored using thin-layer 
chromatography (1 , 2, 10). Obg and cr^P-GTP are incubated in 50 mM Tris (pH 8.5), 
1 .55 mM MgCI 2 , 0.1 mM EDTA, 200 mM KCI, 10% glycerol for 30 minutes at 37°C. 
An aliquot of the reaction is placed on PEI cellulose and the strip developed with 0.5 

20 M KH 2 P0 4 , 1 .0 M NaCI (pH 3.7). The spots conforming to GDP and GTP are 
identified by UV shadowing, excised and counted. . 

Alternatively, the hydrolysis of v^P-GTP is monitored by assaying for 
liberated P, (12). Obg and cPP-GTP are incubated in 50 mM Tris (pH 8.5), 1.5 mM 
MgCI 2 , 0.1 mM EDTA, 100 mM KCI, 10% glycerol for 30 minutes to 3 hours at 37°C. 

25 The reaction is stopped by the addition of a slurry of charcoal in 1 mM Kpi (pH 7.5), 
which selectively binds the GTP and GDP. The liberated P, in the supernatant is 
monitored by Cerenkov counting. Free P, is also monitored with the Malachite Green 
reagent. 

30 Autophosphorylati<> n 

Obg autophosphorylation is monitored by incubating Obg with y^P-GTP in 50 
mM Tris (pH 8.5). 1 .5 mM MgCI 2 , 0.1 mM EDTA, 100 mM KCI, 10% glycerol for 30 
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minutes at 37°C. Samples are analyzed following separation on SDS polyacryiamide 
gels, drying the gel and exposure to film. 

Example 37 

5 rpoa, rpob, rpoc. and rpod, the genes encoding the subun1ts comprising 
alloiococcus q7777p/srna polymerase: alpha, beta, beta', and sigma, 

RNA polymerase is an enzyme comprised of multiple highly conserved 
subunits which catalyzes the DNA template directed polymerization of ribonucleic 

10 nucleotides into ribonucleic acid. It is composed of a core enzyme, 02,0, along 
with a fifth subunit present in stoichiometric amounts, □□□which can catalyze RNA 
synthesis non-specifically. Holoenzyme is formed by the introduction of the subunit 
□ which enhances gene promoter recognition and allows specificity. Homologs of 
the genes identified in Alloiococcus otitidis are described in Example 5/Table 4 (Seq. 

15 ID Nos 7, 9, 1 1 , and 1 3). The amino acid sequence of the protein encoded by these 
genes are set forth in Seq. ID Nos. 8, 10, 12 and 14. 

Functions for the individual subunits have been defined biochemically, and 
interactions between them have now been deduced structurally by crystallographic 
analysis of the enzyme from Thermatoga thermophiia, and to a lesser extent, 

20 Escherichia cofi. The alpha subunit, encoded by rpoA, is required for enzyme 

assembly. It also interacts with transcription factors and with DNA elements involved 
in enhanced promoter strength. Beta, encoded by rpoB, is involved in initiation and 
elongation of the polymerization product. Beta 7 (encoded by rpoC), is responsible for 
binding of the enzyme to the DNA template. Omega is required to restore denatured 

25 RNA polymerase to function in vitro. Finally, sigma, encoded by rpoD, directs the 
enzyme to promoters on the template to enhance specificity of transcription 
(polymerization). 
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Alloiococcus otitidis RNA Polymerase: alpha, beta, beta', and sigma as a 

TARGET FOR ANTI -INFECTIVE DEVELOPMENT 

Bacterial RNA polymerase is a validated target for antimicrobial 
chemotherapy in that several inhibitors have been identified and at least one, 
5 rifampin, is in use clinically. Alloiococcus otitidis RNA polymerase holoenzyme is 
essential for bacterial viability. The Alloiococcis otitidis OHFs- have been shown to 
encode, by sequence homology, RNA polymerase holoenzyme (Seq. ID Nos. 7, 9, 
1 1 and 1 3). Alloiococcus otitidis RNA Polymerase activity in the presence or absence 
of a putative inhibitory molecule of Alloiococcus otitidis RNA Polymerase activity is 
10 used to identify novel antimicrobial agents, which may be used to treat disease 
caused by Alloiococcus otitidis. 

Assays for the activity of RNA polymerase 

Genes encoding the subunits of Alloiococcus otitidis RNA polymerase can be 

15 obtained using polymerase chain reaction amplification of the genomic region 

encoding them. The genes are subcloned into a standard expression vector either 
containing an amino acid tag for ease of purification or not. The enzyme are 
overexpressed in Escherichia coli and purified using a standard tag system or 
conventional chromatography . 

20 Because RNA polymerase catalyzes the incorporation of single ribonucleotides 

into RNA, the incorporation of radiolabeled nucleotides into larger oligonucleotides is 
monitored to measure activity of the enzyme in the presence or absence of putative 
inhibitors of RNA polymerase activity. An automated high throughput filtration assay 
has been previously described for E. coli polymerase which uses filterplates 

25 containing a hydrophobic membrane and DEAE beads to capture polymerized RNA. 
G-less supercoiled DNA is used as a template at 6 ug/ml. Reaction contained 0.5 
mM ATP, 0.1 mM UTP, 0.3 mM OTP, approximately 100,000 counts per minute (per 
100 ul) [y-^P] CTP (2000 Ci/mmol, NEN/DuPont), 4 % polyethylene glycol, 4 mM 
DTT, 10 mM MgCI 2 , in 50 mM Tris-acetate (pH 7.8), and 100 mM potassium acetate. 

30 The reaction is carried out at 34 degrees C for 40 minutes, with 10% DMSO present 
in all reactions. The reaction was stopped by adding 100 ul 15% DEAE-Sephacel 
bead slurry in 50% methanol, 20 mM EDTA, and 0.02% NP-40. The reaction was 
incubated for 40-60 minutes at room temperature without shaking, and then 
transferred to a unifilter plate on a filtermate cell harvester. The wells were washed 
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six times with 2X PBS and 0.1% NP-40. After washing the bottom of the plate was 
sealed, and 50 ul scintillation counting liquid was added. Radioactivity was counted 
using a microplate scintillation counter. 

Deconvolution assays are carried out by measuring the inhibition of sigma 

5 activity. Because sigma is required only for promoter specificity, polymerization may 
occur non-specifically if sigma is inhibited. Consequently a second assay is 
described above that is used to deconvolute activity against sigma. 

The binding of putative inhibitory compounds to core enzyme. Several 
techniques are utilized to determine the interaction of inhibitors with individual 

10 subunits and include nuclear magnetic resonance and capillary electrophoresis. 

Example 38 

YPHC. encoding a small GTPase of unknown function from Alloiococcus 

OTITID1S 

15 

The yphC was initially identified in Bacillus subtilis in a collaboration between 
Wyeth and Millennium pharmaceuticals as being essential for growth by insertional 
mutagenesis. Subsequently it was determined that YphC, the encoded protein, 
contained two GTPase domains and had some homology to era. It was further 

20 identified in Thermatoga maritima and Escherichia coli . While no function has yet 
been determined for yphC, it appears that the carboxy terminal may contain an RNA 
binding site. In addition, site directed mutagenesis of four amino acids in the carboxy 
region were found to be lethal (unpublished results, Millennium). Under non- 
permissive conditions, strains carrying temperature sensitive alleles of the gene in E. 

25 coli become elongated, and chromosome segregation becomes abberrant, 

suggesting a role in cell division. Homologue of this gene identified in Alloiococcus 
otitidis is described in Example 5/Table 4 (Seq. ID No 73). The protein encoded by 
the gene is set forth in Seq. ID No. 74. 
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YphC from Alloiococcus otiiidis as a target for antimicrobial chemotherapy 

YphC is an essential protein in Bacillus subtilis and E. coli, and is conserved 
among bacteria including Alloiococcus otitidis. The Alloiococcis otitidis ORF~ has 

5 been shown to encode, by sequence homology, YphC (Seq. ID No. 73). YphC 

activity in the presence or absence of a putative inhibitory molecule of YphC activity 
is used to identify novel antimicrobial agents, which may be used to treat disease 
caused by Alloiococcus otitidis.. Consequently it is proposed here that an assay 
which identified inhibitors of YphC from Alloiococcus would result in small molecules 

10 which can be developed into effect antimcrobial agents. Additionally, because of the 
conservation of the enzyme among bacteria, inhibitors of the protein's function from 
this organism should have broad spectrum activity. 

Assays for the GTP hydrolysis by YphC 

15 The YphC gene from Alloiococcus otitidis is obtained using polymerase chain 

reaction amplification of the genomic region encoding it. The gene is subcloned into 
a standard expression vector either containing an amino acid tag for ease of 
purification or not. The enzyme is then overexpressed in Escherichia coli and 
purified using a standard tag system or conventional chromatography. Activity of 

20 YphC in the presence or absence putative antimicrobial agents is monitored using 
the assay system described below. 

GTP hydrolysis - detection by thin layer chromatography: Reaction is 
carried out in a 50 ul reaction of 50 mM Tris-CI (pH 7.5), 400 mM KCI, 5 mM MgCI2, 
25 1 mM DTT, 10 uM [a-32P] GTP, and 10 ug purified YphC, at 37 degrees for 10 

minutes. The reaction is terminated by transfer of 5 ul samples to 10 ul of ice-cold 20 
mM EDTA. Portions are spotted onto polyethyleneimine-cellulose thin layer 
chromatography plates, which are developed in 0.75 KH2P04 (pH 3.65). The plate 
is autoradiographed to identify hydrolysis products. 
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WHAT IS CLAIMED IS: 

5 1 . A purified or isolated Alloiococcus otitidis nucleic acid sequence comprising a 
nucleotide sequence selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105, wherein expression of said nucleic acid 
is essential for the proliferation of a cell. 

10 2. A purified or isolated nucleic acid of Alloiococcus otitidis comprising a 

fragment of one of odd numbered sequences set forth in Seq. ID Nos: 1 to 
Seq. ID Nos: 105 said fragment selected from the group consisting of 
fragments comprising at least 10, at least 20, at least 25, at least 30, at least 
50 and more than 50 consecutive nucleotides of one of one of odd numbered 

15 sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

3. A purified or isolated antisense nucleic acid comprising a nucleotide 

sequence complementary to at least a portion of an intragenic sequence, 
intergenic sequence, sequences spanning at least a portion of two or more 
20 genes, 5' noncoding region, or 3' noncoding region within an operon 

comprising a proliferation-required gene of Alloiococcus otitidis whose activity 
or expression is inhibited by an antisense nucleic acid and selected from one 
of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

25 4. A purified or isolated nucleic acid comprising a nucleotide sequence having at 
least 70% identity to a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, 
fragments comprising at least 25 consecutive nucleotides selected from one 
of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, 

30 the nucleotide sequences complementary to one of odd numbered sequences 

set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, and the sequences 
complementary to fragments comprising at least 25 consecutive nucleotides 
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of one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID 
Nos: 105. 

A vector comprising a promoter operably linked to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence of any one of odd numbered sequences 
set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 



10 5. A purified or isolated polypeptide of Alloiococcus otitidis comprising a 
polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence of one of odd numbered sequences set 
forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a fragment selected from the 
group consisting of fragments comprising at least 5, at least 10, at least 20, at 

15 least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 

amino acids of one of the said polypeptides. 

6. A purified or isolated Alloiococcus otitidis polypeptide comprising a amino 
acid sequence having at least 25% amino acid identity to a polypeptide 

20 whose expression is inhibited by a nucleic acid comprising a nucleotide 

sequence selected from one of odd numbered sequences set forth in Seq. ID 
Nos: 1 to Seq. ID Nos: 105, or at least 25% amino acid identity to a fragment 
comprising at least 1 0, at least 20, at least 30, at least 40, at least 50, at least 
60 or more than 60 consecutive amino acids of a polypeptide whose 

25 expression is inhibited by a nucleic acid comprising a nucleotide sequence 

selected from the group consisting of one of odd numbered sequences set 
forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 



30 7. A purified or isolated Alloiococcus otitidis polypeptide comprising selected 

from one of the even numbered sequences set forth in Seq. ID Nos: 2 to Seq. 
ID Nos: 106, wherein the polypeptide is essential for the proliferation of a cell. 
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8. A method of producing an Alloiococcus otitidis polypeptide comprising 

introducing into a cell a vector comprising a promoter operably linked to a 
nucleic acid comprising a nucleotide sequence encoding a polypeptide whose 
5 expression is essential for the proliferation and viability of Alloiococcus 

otitidis, and which is inhibited by an antisense nucleic acid, and which is 
selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 to 
Seq. ID Nos: 105. 

A method of inhibiting the proliferation of Alloiococcus otitidis in an individual 
comprising inhibiting the activity or reducing the amount of a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105 or inhibiting the activity or reducing the 
amount of a nucleic acid encoding said gene product. 



10 



15 



10. A method for identifying a compound which influences the activity of an 
20 Alloiococcus otitidis gene product , which is required for proliferation, said 

gene product comprising a gene product whose expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from one 
of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, 
said method comprising: 

25 

(a) contacting said gene product with a candidate compound; and 

(b) determining whether said compound influences the activity of said 
gene product. 



30 11. A method for identifying a compound or an antisense nucleic acid having the 
ability to reduce activity or level of a Alloiococcus otitidis gene product, which 
is required for proliferation, said gene product comprising a gene product 
whose activity or expression is inhibited by an antisense nucleic acid 
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comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, said method 
comprising the steps of: 

(a) contacting a target gene or RNA encoding said gene product with 
5 a candidate compound or antisense nucleic acid; and 

(b) measuring the activity of said target. 

1 3. A method for inhibiting cellular proliferation of Alloiococcus otitidis comprising 
introducing an effective amount of a compound with activity against a gene 
10 whose activity or expression is essential for cellular proliferation, and which is 

inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 to 
Seq. ID Nos: 105, or a compound with activity against the product of said 
gene into a population of Alloiococcus otitidis cells expressing said gene. 



15 



13. A composition comprising an effective concentration of an antisense nucleic 
acid comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a proliferation- 
inhibiting portion thereof in a pharmaceutical^ acceptable carrier. 



20 



14. A method for identifying a compound having the ability to inhibit proliferation 
of Alloiococcus otitidis cell comprising: 

(a) identifying a homologue of a gene or gene product whose activity 
25 or level is inhibited by a nucleic acid comprising a nucleotide 

sequence selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105, in a test cell, wherein said 
test cell is not Alloiococcus otitidis; 

(a) identifying an inhibitory nucleic acid sequence which inhibits the 
30 activity of said homologue in said test cell; 

(b) contacting said test cell with a sublethal level of said inhibitory 
nucleic acid, thus sensitizing said ceil; 

(c) contacting the sensitized cell of step (c) with a compound; and 
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(d) determining the degree to which said compound inhibits 

proliferation of said sensitized cell relative to a cell which does not 
contain said inhibitory nucleic acid. 

A method for identifying a compound having activity against a biological 

pathway required for proliferation comprising: 

(a) sensitizing a cell by providing a sublethal level of an antisense 
nucleic acid complementary to a nucleic acid encoding a gene 
product required for proliferation, wherein the activity or expression 
of said gene product is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 
1 05, in said cell to reduce the activity or amount of said gene 
product; 

(a) contacting the sensitized cell with a compound; and 

(b) determining the degree to which said compound inhibits the 
growth of said sensitized cell relative to a cell which does not 
contain said antisense nucleic acid. 

A method for identifying a compound having the ability to inhibit one of the 
Alloiococcus otitidis polypeptides encoded by a polynucleotide selected from 
one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 
105, and which is essential for cellular proliferation comprising:. 

(a) contacting a cell which expresses the polypeptide with the 
compound; and 

(b) determining whether said compound reduces proliferation of said 
contacted cell by acting on said gene product. 

A method for identifying a compound having the ability to inhibit one of the 
purified and isolated Alloiococcus otitidis polypeptides selected from one of 
the even numbered sequences set forth in Seq. ID No.: 2 to Seq. ID No.: 106, 
and which is essential for cellular proliferation comprising: 
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(c) contacting the purified and isolated polypeptide with the compound 
in vitro in the presence or absence of a substrate, which is 
essential for the activity of the polypeptide; and 

(d) determining the effect of the compound on the polypeptide by 
measuring the effect of the polypeptide on the substrate. 

A compound which interacts with an Alloiococcus otitidis polypeptide selected 
from one of the even numbered sequences set forth in Seq. ID No.: 2 to Seq. 
ID No.: 106 and inhibits its activity. 

A method for manufacturing an antimicrobial compound comprising the steps 
of screening one or more candidate compounds to identify a compound that 
reduces the activity or level of an Alloiococcus otitidis polypeptide selected 
from one of the even numbered sequences set forth in Seq. ID No.: 2 to Seq. 
ID No.: 106, said polypeptide comprising a gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from one of the odd numbered sequences set forth in Seq. 
ID No.: 1 to Seq. ID No. 105; and manufacturing the compound so identified. 

A compound which inhibits proliferation of Alloiococcus otitidis by interacting 
with a gene encoding a polypeptide that is required for proliferation or with a 
polypeptide required for proliferation, wherein said polypeptide is selected 
from the group consisting of a gene product having at least 70% nucleotide 
sequence identity from one of the odd numbered sequences set forth in Seq. 
ID No.: 1 to Seq. ID No. 105, polypeptide encoded by a nucleic acid having at 
least 70% nucleotide sequence identity to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from one of the odd numbered 
sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, a polypeptide 
having at least 25% amino acid identity to a gene product whose expression 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected one of the odd numbered sequences set forth in Seq. ID No.: 1 to 
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Seq. ID No. 105, a polypeptide encoded by a nucleic acid comprising a 
nucleotide sequence which hybridizes to a nucleic acid selected from one of 
the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105 
under stringent conditions, a gene product encoded by a nucleic acid 
5 comprising a nucleotide sequence which hybridizes to a nucleic acid selected 

from one of the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. 
ID No. 105 under moderate conditions, and a gene product whose activity 
may be complemented by the gene product whose activity is inhibited by a 
nucleic acid selected from one of the odd numbered sequences set forth in 
10 Seq. ID No.: 1 to Seq. ID No. 105. 
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SEQUENCE LISTING 

<110> American Cyanamid Company, and Murphy, Ellen and Projan, Stephen, j 

<120> Alloiococcus otitidis Infectious Disease Targets 

* 

<13 0> Application 1 
<160> 106 

<170> Patentln version 3.1 

<210> 1 
<211> 426 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (73) . . (426) 

<223> 

<400> 1 

aagacaaaaa agaagaggga aaagatctta agacacttcc ctaagtctga acatattcta 



egg att caa get gtt tgg gac cga aag ccc age ttt gec cag egg att 
Arg He Gin Ala Val Trp Asp Arg Lys Pro Ser Phe Ala Gin Arg He 
15 20 25 

tta ace caa agg gag ttg get tat ttc gag aaa gcg act ggt agg cqq 
Leu Thr Gin Arg Glu Leu Ala Tyr Phe Glu Lys Ala Thr Gly Arg Arg 
30 35 40 



aga att gaa ttc eta gcg gga egg ttt gee ggt aaa gaa get tac agt 
Arg He Glu Phe Leu Ala Gly Arg Phe Ala Gly Lys Glu Ala Tyr Ser 

50 55 go 

aaa gee ttg gga act ggt att gga cgc ttg age ttt aaa gat att gaa 
Lys Ala Leu Gly Thr Gly He Gly Arg Leu Ser Phe Lys Asp He Glu 

65 70 75 

ate eta ate aat gac caa ggc cag cca gtc eta aca tct cat cct aaa 
lie Leu He Asn Asp Gin Gly Gin Pro Val Leu Thr Ser His Pro Lys 
80 85 90 



get ggc egg gec ttg att tea att tct cac act aga gac etc tgc ctq 
Ala Gly Arg Ala Leu He Ser He Ser His Thr Arg Asp Leu Cys Leu 
95 100 105 



gee cag gtc ctt tta cag gaa aat tga 
Ala Gin Val Leu Leu Gin Glu Asn 
HO 115 



60 



ggagggttac aa gtg att aca gga atg ggt gtg gat att gtt gaa atg age in 

Met He Thr Gly Met Gly Val Asp He Val Glu Met Ser 
1 5 10 



159 



207 



255 



303 



351 



399 



426 
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<210> 2 
<211> 117 
<212> PRT 

<213>. Alloiococcus otitidis 
<400> 2 

Met He Thr Gly Met Gly Val Asp He Val Glu Met Ser Arg He Gin 
15 10 15 



Ala Val Trp Asp Arg Lys Pro Ser Phe Ala Gin Arg He Leu Thr Gin 

20 25 30 



Arg Glu Leu Ala Tyr Phe Glu Lys Ala Thr Gly Arg Arg Arg He Glu 
35 40 45 



Phe Leu Ala Gly Arg Phe Ala Gly Lys Glu Ala Tyr Ser Lys Ala Leu 
50 55 60 



Gly Thr Gly He Gly Arg Leu Ser Phe Lys Asp He Glu He Leu He 
65 70 75 * 80 



Asn Asp Gin Gly Gin Pro Val Leu Thr Ser His Pro Lys Ala Gly Arg 

85 90 95 



Ala Leu He Ser He Ser His Thr Arg Asp Leu Cys Leu Ala Gin Val 

100 105 110 



Leu Leu Gin Glu Asn 
115 



<210> 3 
<211> 1410 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (16) . . (1410) 
<223> 

<400> 3 

ataggagtca taacg gtg tct tgg aaa tta aaa gag att gcc cag gca gtt 51 

Met Ser Trp Lys Leu Lys Glu He Ala Gin Ala Val 

1 5 10 

ggg gga gag eta gtt agt gcg gac ggc cag gag gag gtc acc ggg gtc 99 
Gly Gly Glu Leu Val Ser Ala Asp Gly Gin Glu Glu Val Thr Gly Val 
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15 20 25 

cac ttt gat tea agg cga ctt gaa cca ggt gac ttg ttt gtt cct att 

His Phe Asp Ser Arg Arg Leu Glu Pro Gly Asp Leu Phe Val Pro He 
30 35 40 



ctg gec aag tgg cat ctt gaa get gtc gca cct atg aaa att gec ate 
Leu Ala Lys Trp His Leu Glu Ala Val Ala Pro Met Lys He Ala He 
95 100 105 



ttg tec aaa etc ttg cag cct gac att gec att ate acc atg att ggc 
Leu Ser Lys Leu Leu Gin Pro Asp He Ala He He Thr Met He Gly 
175 180 185 



147 



tta ggc cag egg gat ggt cat gat ttt gec caa gec gec eta gac caa 195 
Leu Gly Gin Arg Asp Gly His Asp Phe Ala Gin Ala Ala Leu Asp Gin 
4 5 50 55 60 

gga get age gga gec ttt tgg gec aaa gat tea age tta gee cct aaa 243 
Gly Ala Ser Gly Ala Phe Trp Ala Lys Asp Ser Ser Leu Ala Pro Lys 

65 70 75 

ggt ctt ccc ttg ate aag gta gaa gat age tac cag gee eta gtt gac 291 
Gly Leu Pro Leu He Lys Val Glu Asp Ser Tyr Gin Ala Leu Val Asp 

80 85 90 



339 



acc ggc agt aat ggg aag acc act act aag gac atg gtg get agt gtg 3 87 

Thr Gly Ser Asn Gly Lys Thr Thr Thr Lys Asp Met Val Ala Ser Val 
110 115 120 

gtg ggc caa gca ttt aag tgt cac aaa aca gtt age aac tta aat aat 435 
Val Gly Gin Ala Phe Lys Cys His Lys Thr Val Ser Asn Leu Asn Asn 
125 130 135 140 

gaa ctt ggc gtg ccc atg act ate tta get atg cct gca gac tgc cag 483 
Glu Leu Gly Val Pro Met Thr He Leu Ala Met Pro Ala Asp Cys Gin 

145 150 155 

gtc ata gtt gtt gaa atg ggc atg gat gga cca ggt cag ate teg gee 531 
Val He Val Val Glu Met Gly Met Asp Gly Pro Gly Gin He Ser Ala 

160 165 170 



579 



gag gec , cac ate gag ttc ttt ggg tea agg gac aaa att gec cag gec 627 
Glu Ala His He Glu Phe Phe Gly Ser Arg Asp Lys He Ala Gin Ala 
190 195 200 

aaa ctg gaa att eta gat ggc eta age gac cag ggc gtc ttt att gee 675 
Lys Leu Glu He Leu Asp Gly Leu Ser Asp Gin Gly Val Phe He Ala 
205 210 215 220 • 

aac ggg gat gaa ccc ctg ctt gag tct gee ttg aac cac cac ccc cac 723 
Asn Gly Asp Glu Pro Leu Leu Glu Ser Ala Leu Asn His His Pro His 

225 230 235 

age ctg cgt ttt ggc caa teg ccc cac aat gac att tat cct ttg acc 771 
Ser Leu Arg Phe Gly Gin Ser Pro His Asn Asp He Tyr Pro Leu Thr 

240 . 245 250 
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act gag att gga cag egg caa age cag ttc acc ctt aac ctg gac cct 819 
Thr Glu lie Gly Gin Arg Gin Ser Gin Phe Thr Leu Asn Leu Asp Pro 
255 260 265 

agt ctg caa ttt acc ate cct tea cca gga aaa tat aat gtc att aac 867 
Ser Leu Gin Phe Thr lie Pro Ser Pro Gly Lys Tyr Asn Val lie Asn 
270 275 280 

gee eta get gca gtc ttg gta gee cag gtc ctg gac ttg gac etc caa 915 
Ala Leu Ala Ala Val Leu Val Ala Gin Val Leu Asp Leu Asp Leu Gin 
285 290 295 300 

eta get gtc cag ggc ttg gee cag ttt cag eta age aaa aac egg ctg '963 
Leu Ala Val Gin Gly Leu Ala Gin Phe Gin Leu Ser Lys Asn Arg Leu 

305 310 315 

gaa tgg eta aaa ggc tat aag cag gee cac tta tta aat gat get tac 1011 
Glu Trp Leu Lys Gly Tyr Lys Gin Ala His Leu Leu Asn Asp Ala Tyr 

320 325 330 

aat get agt ccc act tec atg aag gcg gtc ttg gat tat ttc age cat 1059 
Asn Ala Ser Pro Thr Ser Met Lys Ala Val Leu Asp Tyr Phe Ser His 
335 340 345 

ttg gac eta gat ggg gag aag ata gcg gtt tta ggg gac ttg egg gag 1107 
Leu Asp Leu Asp Gly Glu Lys lie Ala Val Leu Gly Asp Leu Arg Glu 
350 355 360 

tta ggg tct ttg tec ggt caa etc cac egg tea ctt agt caa gee ate • 1155 
Leu Gly Ser Leu Ser Gly Gin Leu His Arg Ser Leu Ser Gin Ala He 
365 370 375 380 

gac ccc aaa ctt tta gac egg gtt gtc tta tat gga cca gaa atg gca 1203 
Asp Pro Lys Leu Leu Asp Arg Val Val Leu Tyr Gly Pro Glu Met Ala 

385 390 395 

gee etc tac cag gtc ttg aag get gat ttt gat cct gac cac ttg act 12 51 

Ala Leu Tyr Gin Val Leu Lys Ala Asp Phe Asp Pro Asp His Leu Thr 

400 405 410 

tat ttc cca gag gat cga aaa gec ttg acc gac ttt tta aaa gaa ate 1299 
Tyr Phe Pro Glu Asp Arg Lys Ala Leu Thr Asp Phe Leu Lys Glu He 
415 420 425 

atg ggc cca tct tct tat ctt ttg ttg aag tec agt eta gga aca ggt 1347 
Met Gly Pro Ser Ser Tyr Leu Leu Leu Lys Ser Ser Leu Gly Thr Gly 
430 435 440 

ctg ctt gaa gtg gtc caa gec eta agt caa aaa gaa gat gat gaa aac 1395 
Leu Leu Glu Val Val Gin Ala Leu Ser Gin Lys Glu Asp Asp Glu Asn 
445 450 455 460 

cag ccc ctg gac taa 1410 
Gin Pro Leu Asp 
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<210> 4 
<211> 464 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 4 

Met Ser Trp Lys Leu Lys Glu lie Ala Gin Ala Val Gly Gly Glu Leu 
15 10 15 



Val Ser Ala Asp Gly Gin Glu Glu Val Thr Gly Val His Phe Asp Ser 

20 25 30 



Arg Arg Leu Glu Pro Gly Asp Leu Phe Val Pro lie Leu Gly Gin Arg 
35 40 45 



Asp Gly His Asp Phe Ala Gin Ala Ala Leu Asp Gin Gly Ala Ser Gly 
50 55 60 



Ala Phe Trp Ala Lys Asp Ser Ser Leu Ala Pro Lys Gly Leu Pro Leu 
65 70 75 80 



lie Lys Val Glu Asp Ser Tyr Gin Ala Leu Val Asp Leu Ala Lys Trp 

85 90 95 



His Leu Glu Ala Val Ala Pro Met Lys lie Ala lie Thr Gly Ser Asn 

100 105 110 



Gly Lys Thr Thr Thr Lys Asp Met Val Ala Ser Val Val Gly Gin Ala 
115 120 125 



Phe Lys Cys His Lys Thr Val Ser Asn Leu Asn Asn Glu Leu Gly Val 
130 135 140 



Pro Met Thr lie Leu Ala Met Pro Ala Asp Cys Gin Val lie Val Val 
145 150 155 160 



Glu Met Gly Met Asp Gly Pro Gly Gin lie Ser Ala Leu Ser Lys Leu 

165 170 175 



Leu Gin Pro Asp He Ala He Xle Thr Met He Gly Glu Ala His, lie 

180 185 190 



Glu Phe Phe Gly Ser Arg Asp Lys He Ala Gin Ala Lys Leu Glu He 
195 200 205 
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Leu Asp Gly Leu Ser Asp Gin Gly Val Phe lie Ala Asn Gly Asp Glu 
210 215 220 



Pro Leu Leu Glu Ser Ala Leu Asn His His Pro His Ser Leu Arg Phe 
225 230 235 240 



Gly Gin Ser Pro His Asn Asp lie Tyr Pro Leu Thr Thr Glu lie Gly 

245 250 255 



Gin Arg Gin Ser Gin Phe Thr Leu Asn Leu Asp Pro Ser Leu Gin Phe 

260 265 270 



Thr lie Pro Ser Pro Gly Lys Tyr Asn Val He Asn Ala Leu Ala Ala 
275 280 285 



Val Leu Val Ala Gin Val Leu Asp Leu Asp Leu Gin Leu Ala Val Gin 
290 295 300 



Gly Leu Ala Gin Phe Gin Leu Ser Lys Asn Arg Leu Glu Trp Leu Lys 
305 310 315 320 



Gly Tyr Lys Gin Ala His Leu Leu Asn Asp Ala Tyr Asn Ala Ser Pro 

325 330 335 



Thr Ser Met Lys Ala Val Leu Asp Tyr Phe Ser His Leu Asp Leu Asp 

340 345 350 



Gly Glu Lys He Ala Val Leu Gly Asp Leu Arg Glu Leu Gly Ser Leu 
355 360 365 



Ser Gly Gin Leu His Arg Ser Leu Ser Gin Ala He Asp Pro Lys Leu 
370 375 380 



Leu Asp Arg Val Val Leu Tyr Gly Pro Glu Met Ala Ala Leu Tyr Gin 
385 390 395 400 



Val Leu Lys Ala Asp Phe Asp Pro Asp His Leu Thr Tyr Phe Pro Glu 

405 410 415 



Asp Arg Lys Ala Leu Thr Asp Phe Leu Lys Glu He Met Gly Pro Ser 

420 425 430 
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Ser Tyr Leu Leu Leu Lys Ser Ser Leu Gly Thr Gly Leu Leu Glu Val 
435 440 445 

Val Gin Ala Leu Ser Gin Lys Glu Asp Asp Glu Asn Gin Pro Leu Asp 
450 455 4 6 o 



<210> 5 
<211> 1284 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (1284) 

<223> 



<400> 5 

gattgg atg aat ata atg aaa aaa eta ate ate aac ggt ggc egg aec 48 

Met Asn lie Met Lys Lys Leu He He Asn Gly Gly Arg Thr 
1 5 10 

etc aag ggt gaa gtc acg gta tea ggg gee aaa aat agt acg gtg get 96 
Leu Lys Gly Glu Val Thr Val Ser Gly Ala Lys Asn Ser Thx- Val Ala 
15 20 



25 30 



Leu He Pro Ala Ser t"* V° ^ CC9 *=* * tC Cta gag 999 144 

^eu xxe Pro Ala Ser He Leu Ala Asp Ser Pro Val He Leu Glu Gly 



35 40 



45 



v«f ? St « C C ? 9 gat 9tt Cat tcc cta ct 9 W at t tta aat gaa 192 

Val Pro Asp He Gin Asp Val His Ser Leu Leu Glu He Leu Asn Glu 



50 55 



60 

240 



atg aat gtc aag aec gac ttt gac gga aac act ttg ace att gac cca 
Met Asn Val Lys Thr Asp Phe Asp Gly Asn Thr Leu Thr He Asp Pro 
65 70 75 

a™ rt* J*? tCt atC CCC atg CCa a S fc fiWt aa 9 a tc caa age ttg 288 

Arg Glu Met Val Ser He Pro Met Pro Ser Gly Lys He Gin Ser Leu 



c CC taC tt:t atg gga gCC ctc S cc aaa ttc ggt aaa 336 

Arg Ala Ser Tyr Tyr Phe Met Gly Ala Leu Leu Ala Lys Phe Gly Lys 



100 105 



110 



G?v V^T Ctt CCC ^ tgC " C Ctg ggg cca cga ccc 

Gly Val Val Gly Leu Pro Gly Gly Cys Phe Leu Gly Pro Arg Pro lie 

115 120 



384 



125 



So Gin H?! r tff 2?° S° ^ C ° tg Ctt g9a gCa gat gtg gat aat 

Asp Gin His Leu Lys Gly Phe Arg Leu Leu Gly Ala Asp Val Asp Asn 

130 135 140 



432 
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gaa atg ggg gcc atg tac ctt aaa acc agt gat tea ggc eta gtg ggt 480 
Glu Met Gly Ala Met Tyr Leu Lys Thr Ser Asp Ser Gly Leu Val Gly 
145 150 155 

agt egg att tac tta gat gtt gtt teg att ggt gca acc att aat ate 528 
Ser Arg lie Tyr Leu Asp Val Val Ser lie Gly Ala Thr He Asn He 
160 165 170 

atg tta gcc get gtt agg gcc caa ggt egg acg gtc att gag aat gcg 576 
Met Leu Ala Ala Val Arg Ala Gin Gly Arg Thr Val He Glu Asn Ala 
175 180 185 190 

gcc cga gaa cca gaa att att gat gtt gcc acc etc ttg aac aag atg 624 
Ala Arg Glu Pro Glu He He Asp Val Ala Thr Leu Leu Asn Lys Met 

195 200 205 

ggg get aaa ata cgt ggg get ggc act gat atg ate egg att gaa ggg 672 

Gly Ala Lys He Arg Gly Ala Gly Thr Asp Met He Arg He Glu Gly 

210 215 220 

) - 

gtt gac cag ctg act ggc tgc cag cac tec ate ate ccc gac egg att 72 0 

Val Asp Gin Leu Thr Gly Cys Gin His Ser He He Pro Asp Arg lie 
225 230 235 

gaa get ggg acc tac ctg get att gca gcg gca get ggg gag gat gtc 7 68 

Glu Ala Gly Thr Tyr Leu Ala He Ala Ala Ala Ala Gly Glu Asp Val 
240 245 250 

ctg gta aac aat gtt ata gtt gaa cat att gat agt tta att gcc aaa 816 
Leu Val Asn Asn Val He Val Glu His He Asp Ser Leu He Ala Lys 
255 260 265 270 

etc gac gaa att ggt att gac ctg gac ate ggc gaa gac agt ate egg 864 
Leu Asp Glu He Gly He Asp Leu Asp He Gly Glu Asp Ser He Arg 

275 280 285 

gtg aaa gcc ccc agt aaa cct ttg cag cct gtt acc ate aaa acc ctg 912 
Val Lys Ala Pro Ser Lys Pro Leu Gin Pro Val Thr He Lys Thr Leu 

290 295 300 

cct tac cct ggt ttt gcc act gac etc cag cag ccc ate acc cct etc 960 
Pro Tyr Pro Gly Phe Ala Thr Asp Leu Gin Gin Pro He Thr Pro Leu 
305 310 315 

ttg ctt ctg gcc aaa ggg gag tec gtt ate acc gat acc ate tat cct 1008 
Leu Leu Leu Ala Lys Gly Glu Ser Val He Thr Asp Thr He Tyr Pro 
320 325 330 

aaa egg gtt aag cac ate cct gag ctg gaa egg atg ggg gcc aat ate 1056 
Lys Arg Val Lys His He Pro Glu Leu Glu Arg Met Gly Ala Asn He 
335 340 345 350 

egg gtc gaa age gat ate ate etc att gaa ggt ggc cac ccc etc aag 1104 
Arg Val Glu Ser Asp He He Leu He Glu Gly Gly His Pro Leu Lys 

355 360 365 
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ggg gca gaa gtg gaa gcc agt gat tta aga gcc ggg get tgc ttg att 1152 
Gly Ala Glu Val Glu Ala Ser Asp Leu Arg Ala Gly Ala Cys Leu He 

370 375 380 

aat gca ggt ttg ate gcg gaa ggt cag acg gaa att act ggc gtt gac 1200 
Asn Ala Gly Leu He Ala Glu Gly Gin Thr Glu He Thr Gly Val Asp 
385 * 390 395 

aaa att eta aga ggc tac tct cat att gtt gaa aaa etc aat gac eta 1248 
Lys He Leu Arg Gly Tyr Ser His He Val Glu Lys Leu Asn Asp Leu 
400 405 410 

ggc gca gat gtt tat atg caa gag ggg gaa gac tga 1284 
Gly Ala Asp Val Tyr Met Gin Glu Gly Glu Asp 
415 420 425 



<210> 6- 
<211> 425 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 6 

Met Asn He Met Lys Lys Leu He He Asn Gly Gly Arg Thr Leu Lys 
15 10 15 



Gly Glu Val Thr Val Ser Gly Ala Lys Asn Ser Thr Val Ala Leu He 

20 25 30 



Pro Ala Ser He Leu Ala Asp Ser Pro Val He Leu Glu Gly Val Pro 
35 40 45 



Asp He Gin Asp Val His Ser Leu Leu Glu He Leu Asn Glu Met Asn 
50 55 60 



Val Lys Thr Asp Phe Asp Gly Asn Thr Leu Thr He Asp Pro Arg Glu 
65 70 75 80 



Met Val Ser He Pro Met Pro Ser Gly Lys He Gin Ser Leu Arg Ala 

85 90 95 



Ser Tyr Tyr Phe Met Gly Ala Leu Leu Ala Lys Phe Gly Lys Gly Val 

100 105 110 



Val Gly Leu Pro Gly Gly Cys Phe Leu Gly Pro Arg Pro He Asp Gin 
115 120 125 



His Leu Lys Gly Phe Arg Leu Leu Gly Ala Asp Val Asp Asn Glu Met 
130 135 140 
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Gly Ala Met Tyr Leu Lys Thr Ser Asp Ser Gly Leu Val Gly Ser Arg 
145 150 155 160 



lie Tyr Leu Asp Val Val Ser lie Gly Ala Thr lie Asn lie Met Leu 

165 170 175 



Ala Ala Val Arg Ala Gin Gly Arg Thr Val lie Glu Asn Ala Ala Arg 

180 185 190 



Glu Pro Glu lie lie Asp Val Ala Thr Leu Leu Asn Lys Met Gly Ala 
195 200 205 



Lys lie Arg Gly Ala Gly Thr Asp Met He Arg He Glu Gly Val Asp 
210 215 220 



Gin Leu Thr Gly Cys Gin His Ser He He Pro Asp Arg He Glu Ala 
225 230 235 240 



Gly Thr Tyr Leu Ala lie Ala Ala Ala Ala Gly Glu Asp Val Leu Val 

245 250 255 



Asn Asn Val He Val Glu His He Asp Ser Leu He Ala Lys Leu Asp 

260 265 270 



Glu He Gly He Asp Leu Asp He Gly Glu Asp Ser He Arg Val Lys 
275 280 285 



Ala Pro Ser Lys Pro Leu Gin Pro Val Thr He Lys Thr Leu Pro Tyr 
290 295 300 



Pro Gly Phe Ala Thr Asp Leu Gin Gin Pro He Thr Pro Leu Leu Leu 
305 310 315 320 



Leu Ala Lys Gly Glu Ser Val He Thr Asp Thr He Tyr Pro Lys Arg 

325 330 335 



Val Lys His He Pro Glu Leu Glu Arg Met Gly Ala Asn He Arg Val 

340 345 350 



Glu Ser Asp He He Leu He Glu Gly Gly His Pro Leu Lys Gly Ala 
355 360 365 
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Glu Val Glu Ala Ser Asp Leu Arg Ala Gly Ala Cys Leu lie Asn Ala 
370 375 380 



Gly Leu He Ala Glu Gly Gin Thr Glu He Thr Gly Val Asp Lys He 
385 390. 395 400 



Leu Arg Gly Tyr Ser His He Val Glu Lys Leu Asn Asp Leu Gly Ala 

405 410 415 



Asp Val Tyr Met Gin Glu Gly Glu Asp 

420 425 



<210> 7 
<211> 612 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (612) 

<223> 

<400> 7 

ctt ttg cat aga caa gac ttg aat cgt gaa agg aag tea gat gtg gaa 48 

Met His Arg Gin Asp Leu Asn Arg Glu Arg Lys Ser Asp Val Glu 
15 10 15 

tta aaa gag ttt gat gga aag aaa aaa gaa gaa eta gec atg att gat 96 
Leu Lys Glu Phe Asp Gly Lys Lys Lys Glu Glu Leu Ala Met He Asp 

20 25 30 

r 

gtg gec aag gec att tta gac cag gtc cat gac ttg atg cac ttc aac 144 

Val Ala Lys Ala He Leu Asp Gin Val His Asp Leu Met His Phe Asn 

35 40 45 

gac etc ttg agt gaa gtg tct gaa tat eta. gac .ttg tea gat gac gag 192 
Asp Leu Leu Ser Glu Val Ser Glu Tyr Leu Asp Leu Ser Asp Asp Glu 
50 55 60 

ate gaa age ggt atg ggc caa ttt tac acc gat tta aat att gac ggt 240 
He Glu Ser Gly Met Gly Gin Phe Tyr Thr Asp Leu Asn He Asp Gly 
65 70 75 

cgc ttc ate tct tta ggc gac aac cat tgg ggc tta cgt gaa tgg tat 288 
Arg Phe He Ser Leu Gly Asp Asn His Trp Gly Leu Arg Glu Trp Tyr 
80 85 90 95 

cca gtc gat tct ate gat gaa gag ttg acc cac gac aat gac ctg gag 33 6 

Pro Val Asp Ser He Asp Glu Glu Leu Thr His Asp Asn Asp Leu Glu 

100 105 110 



aag gtc aca ccc aag cag gcg gaa gac ggc ttt gat gac tta gag cat 



384 
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Lys Val Thr Pro Lys Gin Ala Glu Asp Gly Phe Asp Asp Leu Glu His 

115 120 125 

gtc gaa aaa gaa gtg atg gat gac gca aaa gaa gaa tta gat gac cag 
Val Glu Lys Glu Val Met Asp Asp Ala Lys Glu Glu Leu Asp Asp Gin 
130 135 140 

gcc gtc aat gaa gat gaa gaa aat gtt get cca gat gaa ate acc gac 
Ala Val Asn Glu Asp Glu Glu Asn Val Ala Pro Asp Glu lie Thr Asp 
145 150 155 

gat gga gat gaa gac aag ctg gat gaa tac tct age gat ate gaa gac 
Asp Gly Asp Glu Asp Lys Leu Asp Glu Tyr Ser Ser Asp He Glu Asp 
160 165 170 175 

etc gaa gat gat cgt aag get age caa gac aag ctg tec att gtt gac 
Leu Glu Asp Asp Arg Lys Ala Ser Gin Asp Lys Leu Ser He Val Asp 

180 185 190 

gac gaa gat gtc tta aca aat gat gac gat gag taa 
Asp. Glu Asp Val Leu Thr Asn Asp Asp Asp Glu 

195 200 



<210> 8 
<211> 202 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 8 

Met His Arg Gin Asp Leu Asn Arg Glu Arg Lys Ser Asp Val Glu Leu 
1 5 10 15 



Lys Glu Phe Asp Gly Lys Lys Lys Glu Glu Leu Ala Met He Asp Val 

20 25 30 



Ala Lys Ala He Leu Asp Gin Val His Asp Leu Met His Phe Asn Asp 
35 40 45 



Leu Leu Ser Glu Val Ser Glu Tyr Leu Asp Leu Ser Asp Asp Glu He 
50 55 60 



Glu Ser Gly Met Gly Gin Phe Tyr Thr Asp Leu Asn He Asp Gly Arg 
65 70 75 80 



Phe He Ser Leu Gly Asp Asn His Trp Gly Leu Arg Glu Trp Tyr Pro 

85 90 95 



432 



480 



528 



576 



612 



Val Asp Ser He Asp Glu Glu Leu Thr His Asp Asn Asp Leu Glu Lys 

100 105 110 
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Val Thr Pro Lys Gin Ala Glu Asp Gly Phe Asp Asp Leu Glu His Val 
115 120 125 



Glu Lys Glu Val Met Asp Asp Ala Lys Glu Glu Leu Asp Asp Gin Ala 
130 135 140 



Val Asn Glu Asp Glu Glu Asn Val Ala Pro Asp Glu lie Thr Asp Asp 
145 150 155 160 



Gly Asp Glu Asp Lys Leu Asp Glu Tyr Ser Ser Asp He Glu Asp Leu 

165 170 175 



Glu Asp Asp Arg Lys Ala Ser Gin Asp Lys Leu Ser He Val Asp Asp 

180 185 190 



Glu Asp Val Leu Thr Asn Asp Asp Asp Glu 
195 200 



<210> 9 
<211> 942 
<212> DMA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (1) . . (942) 

<223> 

<400> 9 

atg ate gaa att gaa aag cca gta att gaa aca gta gag ate agt gaa 48 

Met He Glu He Glu Lys Pro Val He Glu Thr Val Glu He Ser Glu 
1 5 10 15 

gat ggc aaa ttc ggt aag ttt gtt gtt gaa cca ttg gaa cgt ggt tat 96 
Asp Gly Lys Phe Gly Lys Phe Val Val Glu Pro Leu Glu Arg Gly Tyr 

20 25 30 

ggg act acc tta ggg aat tec tta cgc cgc ate tta tta tea tea eta 144 
Gly Thr Thr Leu Gly Asn Ser Leu Arg Arg He Leu Leu Ser Ser Leu 
35 40 45 



ccg ggt get gcg gtc acc aat att caa att gat ggt gtt ttg cat gag 
Pro Gly Ala Ala Val Thr Asn He Gin He Asp Gly Val Leu His Glu 
50 55 60 



192 



ttt aca get att gat ggt gtg gtt gaa gat gtg act tec ate ate tta 240 
Phe Thr Ala He Asp Gly Val Val Glu Asp Val Thr Ser He lie Leu 
65 70 75 80 

aac ctg aaa aaa ctg get tta aaa ctt cat act gaa gaa aca aaa aca 288 
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Asn Leu Lys Lys Leu Ala Leu Lys Leu His Thr Glu Glu Thr Lys Thr 

85 90 95 

att gaa ttg gat att gaa ggc cct get gaa gtg aca gca get gat att 
He Glu Leu Asp He Glu Gly Pro Ala Glu Val Thr Ala Ala Asp lie 

100 105 110 

att act gat agt gat gtt gag att atg aat cca gac eta tac ttg tgt 
He Thr Asp Ser Asp Val Glu He Met Asn Pro Asp Leu Tyr Leu Cys 
115 120 125 

act gtt tct gaa ggt ggt cat tta cac ate egg atg gaa gca gaa act 
Thr Val Ser Glu Gly Gly His Leu His He Arg Met Glu Ala Glu Thr 
130 135 140 

ggt aga ggt tat gtg aat gca gag cac aac aag cat gat gat atg cca 
Gly Arg Gly Tyr Val Asn Ala Glu His Asn Lys His Asp Asp Met Pro 
145 150 ' 155 160 

ate ggt gtt ttg cca att gat tea att tat acc cca att age cgt gtc 
He Gly Val Leu Pro He Asp Ser He Tyr Thr Pro He Ser Arg Val 

165 170 175 

aac tat act gtt gaa gac acc cgc gtt ggt gaa cgc gag caa tat gat 
Asn Tyr Thr Val Glu Asp Thr Arg Val Gly Glu Arg Glu Gin Tyr Asp 

180 185 190 

aag tta acc ctg gat att tgg aca gat gga tec ate tec cca gag gat 
Lys Leu Thr Leu Asp He Trp Thr Asp Gly Ser He Ser Pro Glu Asp 
195 200 205 

ggc ttg agt eta gcg get aag ate atg aat gaa cac ttg aac ate ttc 
Gly Leu Ser Leu Ala Ala Lys He Met Asn Glu His Leu Asn He Phe 
210 215 220 

ate aac tta act gag caa gca cgt gaa gcg gac att atg gtt gaa aaa 
He Asn Leu Thr Glu Gin Ala Arg Glu Ala Asp He Met Val Glu Lys 
225 230 235 240 

gaa gaa gac cag aaa gaa aaa atg ctt gag atg acc ate gaa gag ctt 
Glu Glu Asp Gin Lys Glu Lys Met Leu Glu Met .Thr He Glu Glu Leu 

245 250 255 

gat tta tct gtt egg tct tac aac tgt ttg aaa cgt get ggc ate aat 
Asp Leu Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly lie Asn 

260 265 270 

act gtc caa gaa eta acg gac aaa act gaa ccg gaa atg atg aaa gtt 
Thr Val Gin Glu Leu Thr Asp Lys Thr Glu Pro Glu Met Met Lys Val 
275 280 285 

cgc aat etc gga cgt aag tea tta gaa gaa gtt aaa aac aag ctt gat 
Arg Asn Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Asn Lys Leu Asp 
290 295 300 



gac tta gac eta age ttg aaa gaa gaa tag 
Asp Leu Asp Leu Ser Leu Lys Glu Glu 
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305 310 



<210> 10 
<211> 313 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 10 

Met lie Glu He Glu Lys Pro Val He Glu Thr Val Glu He Ser Glu 
15 10 15 



Asp Gly Lys Phe Gly Lys Phe Val Val Glu Pro Leu Glu Arg Gly Tyr 

20 25 30 



Gly Thr Thr Leu Gly Asn Ser Leu Arg Arg He Leu Leu Ser Ser Leu 
35 40 45 



Pro Gly Ala Ala Val Thr Asn He Gin He Asp Gly Val Leu His Glu 
50 55 60 



Phe Thr Ala He Asp Gly Val Val Glu Asp Val Thr Ser lie He Leu 
65 70 75 80 



Asn Leu Lys Lys Leu Ala Leu Lys Leu His Thr Glu Glu Thr Lys Thr 

85 90 95 



He Glu Leu Asp He Glu Gly Pro Ala Glu Val Thr Ala Ala Asp He 

100 105 110 



He Thr Asp Ser Asp Val Glu He Met Asn Pro Asp Leu Tyr Leu Cys 
115 120 125 



Thr Val Ser Glu Gly Gly His Leu His He Arg Met Glu Ala Glu Ttrr 
130 135 140 



Gly Arg Gly Tyr Val Asn Ala Glu His Asn Lys His Asp Asp Met Pro 
145 150 155 160 



He Gly Val Leu Pro He Asp Ser He Tyr Thr Pro He Ser Arg Val 

165 170 175 



Asn Tyr Thr Val Glu Asp Thr Arg Val Gly Glu Arg Glu Gin Tyr Asp 

180 185 190 
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Lys Leu Thr Leu Asp He Trp Thr Asp Gly Ser He Ser Pro Glu Asp 
195 200 205 



Gly Leu Ser Leu Ala Ala Lys He Met Asn Glu His Leu Asn He Phe 
210 215 220 



He Asn Leu Thr Glu Gin Ala Arg Glu Ala Asp He Met Val Glu Lys 
225 230 235 240 



Glu Glu Asp Gin Lys Glu Lys Met Leu Glu Met Thr He Glu Glu Leu 

245 250 255 



Asp Leu Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn 

260 265 270 



Thr Val Gin Glu Leu Thr Asp Lys Thr Glu Pro Glu Met Met Lys Val 
275 280 285 



Arg Asn Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Asn Lys Leu Asp 
290 295 300 



Asp Leu Asp Leu Ser Leu Lys Glu Glu 
305 310 



<210> 11 
<211> 3681 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (22) . . (3681) 

<223> 

<400> 11 

aataaaggga ggtttgcccc c ttg gta gat gta aat aat ttt gaa agt att 51 

Met Val Asp Val Asn Asn Phe Glu Ser He 

1 5 10 

caa att gga ctg get tea cca gag aaa ate cgt tea tgg tct cat ggt 99 
Gin He Gly Leu Ala Ser Pro Glu Lys He Arg Ser Trp Ser His Gly 

15 20 25 

gaa gtg aag aaa cct gaa ace att aac tac egg aca tta aaa cct gaa 147 
Glu Val Lys Lys Pro Glu Thr He Asn Tyr Arg Thr Leu Lys Pro Glu 

30 35 40 



aaa gac ggt ttg ttc tgc gaa cgc att ttt ggc cca ace aag gac tat 195 
Lys Asp Gly Leu Phe Cys Glu Arg He Phe Gly Pro Thr Lys Asp Tyr 
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45 50 55 

gaa tgt get tgc gga aaa tat aaa cga gtc cac tat aaa ggg ata gtt 243 

Glu Cys Ala Cys Gly Lys Tyr Lys Arg Val His Tyr Lys Gly He Val 
60 65 70 



tgt gac cgt tgc ggt gtt gaa gtc acc aag teg agt gtc aga cga gaa 
Cys Asp Arg Cys Gly Val Glu Val Thr Lys Ser Ser Val Arg Arg Glu 
75 80 85 90 



ttc aag ggt att cca agt egg atg ggc ctt ate tta gat atg age cca 
Phe Lys Gly He Pro Ser Arg Met Gly Leu He Leu Asp Met Ser Pro 

110 115 120 

aga tec ttg gaa gaa att ate tat ttt gee tct tat gtt gtt att gac 
Arg Ser Leu Glu Glu He He Tyr Phe Ala Ser Tyr Val Val He Asp 
125 130 135 



291 



cgc atg ggc cac ttg gaa tta gca get cct gtc acc cac att tgg tac 339 
Arg Met Gly His Leu Glu Leu Ala Ala Pro Val Thr His He Trp Tyr 

95 100 105 



387 



435 



ggt ggg gat acc ccg ctt gaa cgc aaa cag etc tta act gaa cgt gaa 483 
Gly Gly Asp Thr Pro Leu Glu Arg Lys Gin Leu Leu Thr Glu Arg Glu 
140 145 150 

tac egg gaa aac aaa age aag tac ggc aat gaa ttc caa get gaa att 531 
Tyr Arg Glu Asn Lys Ser Lys Tyr Gly Asn Glu Phe Gin Ala Glu He 
155 160 165 170 

gga get gaa get gtt egg acc ttg eta aaa aat gtc gat ttg gaa caa 579 
Gly Ala Glu Ala Val Arg Thr Leu Leu Lys Asn Val Asp Leu Glu Gin 

175 180 185 

gaa gtt get gac etc aaa gaa ate tta gaa act gca act ggc caa aaa 627 
Glu Val Ala Asp Leu Lys Glu He Leu Glu Thr Ala Thr Gly Gin Lys 

190 195 200 

egg acc egg get att cgt cgt tta gac att att gac tec ttc aag tct 675 
Arg Thr Arg Ala He Arg Arg Leu Asp He He Asp Ser Phe Lys Ser 
205 210 215 

tec aac aac aaa ccg gaa tgg atg gtc ttg gat get att cca att ate 723 
Ser Asn Asn Lys Pro Glu Trp Met Val Leu Asp Ala He Pro He Ile- 
220 225 230 

cca cct gaa etc cgc cca atg gta caa eta gaa ggt ggc egg ttt gca 771 
Pro Pro Glu Leu Arg Pro Met Val Gin Leu Glu Gly Gly Arg Phe Ala 
235 240 245 250 

acc age gac ttg aac gac ttg tac cgc egg gtg att aac egg aac aac 819 
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val He Asn Arg Asn Asn 

255 260 265 

egg ttg aaa cgc ttg ctt gac ttg aat gec ccc cac att ate gtc caa 867 
Arg Leu Lys Arg Leu Leu Asp Leu Asn Ala Pro His He He Val Gin 

270 275 280 
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aat gaa aaa egg atg ctg caa gaa get gtt gac gec ttg att gac aat 
Asn Glu Lys Arg Met Leu Gin Glu Ala Val Asp Ala Leu lie Asp Asn 
285 290 295 

ggt cgt cgc ggt egg gca gtc aac ggt cct ggt aac cgt ccg ctt aaa 
Gly Arg Arg Gly Arg Ala Val Asn Gly Pro Gly Asn Arg Pro Leu Lys 
300 305 310 

tct ctt tct cac atg ttg aaa ggg aaa caa ggg cgc ttc cgt cag aac 
Ser Leu Ser His Met Leu Lys Gly Lys Gin Gly Arg Phe Arg Gin Asn 
315 320 325 330 

eta eta ggg aaa egg gtt gac tac tct ggc egg tct gtc att gtt gtt 
Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg Ser Val He Val Val 

335 340 345 

ggg cca ace ctt aaa atg tac caa tgt ggt eta ccg aaa gaa atg gec 
Gly Pro Thr Leu Lys Met Tyr Gin Cys Gly Leu Pro Lys Glu Met Ala 

350 355 360 

ate gaa etc ttc aaa cct ttt gtc atg egg gag eta gtt gag cga gat 
He Glu Leu Phe Lys Pro Phe Val Met Arg Glu Leu Val Glu Arg Asp 
365 370 375 

att gca aat aac att aaa aat gec aaa cga aaa gtg gaa egg atg gaa 
He Ala Asn Asn He Lys Asn Ala Lys Arg Lys Val Glu Arg Met Glu 
380 385 390 



915 



etc tta aac egg gec cct acc ctt cac egg eta ggg ate caa gec ttt 
Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly He Gin Ala Phe 

415 420 425 

gaa cct gtc ctt gtc aat ggg aag get att cgc tta cac cca etc get 
Glu Pro t Val Leu Val Asn Gly Lys Ala He Arg Leu His Pro Leu Ala 

430 435 440 

tgt gaa gee tac aat get gac ttt gac gga gac caa atg get gtc cac 
Cys Glu Ala Tyr Asn Ala Asp Phe Asp Gly Asp Gin Met Ala Val His 
445 450 455 

gta ccc etc agt gat gaa gec cag gca gaa gee cgc ate tta atg ctg 
Val Pro Leu Ser Asp Glu Ala Gin Ala Glu Ala Arg He Leu Met Leu 
460 465 470 



cct tec caa gac atg gtc eta ggg aac tac tac eta acc atg gaa gaa 
Pro Ser Gin Asp Met Val Leu Gly Asn Tyr Tyr Leu Thr Met Glu Glu 

495 500 505 



963 



1011 



1059 



1107 



1155 



1203 



gat gat gtc tgg cct gtt tta gaa gat gtc att aaa gaa cac cct gtc 1251 
Asp Asp Val Trp Pro Val Leu Glu Asp Val He Lys Glu His Pro Val 
395 400 405 410 



1299 



1347 



1395 



1443 



ggt gec caa aat ate tta aac cct aaa gat ggt caa cca gtc gtt acc 1491 
Gly Ala Gin Asn He Leu Asn Pro Lys Asp Gly Gin Pro Val Val Thr 
475 480 485 490 



1539 
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gaa ggt aaa att ggt 
Glu Gly Lys lie Gly 

510 

ate caa gec tac caa 
lie Gin Ala Tyr Gin 
525 

ate cgt gcg gtg gac 
lie Arg Ala Val Asp 
540 

gac aag tac ttg att 
Asp Lys Tyr Leu lie 
555 

atg cca gca gaa ttt 
Met Pro Ala Glu Phe 

575 

gaa cag caa acc cca 
Glu Gin Gin Thr Pro 

590 

aaa gac ctt att gec 
Lys Asp Leu lie Ala 
605 

gac ctg tec aac att 
Asp Leu Ser Asn He 
620 

gaa acc tct aaa atg 
Glu Thr Ser Lys Met 
635 

tct acc egg tct ggt 
Ser Thr Arg Ser Gly 

655 

gaa get aaa cca gaa 
Glu Ala Lys Pro Glu 

670 

ate aat gec acc cac 
He Asn Ala Thr His 
685 

gac aac gtt ate gat 
Asp Asn Val He Asp 
700 

gee ttg atg gat tec 
Ala Leu Met Asp Ser 
715 



gaa gga act gtc ttc 
Glu Gly Thr Val Phe 

515 

aca ggc tat gtc cac 
Thr Gly Tyr Val His 
530 

tta ccg gac aaa cct 
Leu Pro Asp Lys Pro 
545 

acc aca gtc ggt aag 
Thr Thr Val Gly Lys 
560 

cca ttc ttg aac gaa 
Pro Phe Leu Asn Glu 

580 

gac aag tac ttt gtc 
Asp Lys Tyr Phe Val 

595 

gac cgt cct tta gtt 
Asp Arg Pro Leu Val 
610 

ate gec gaa gtc ttt 
He Ala Glu Val Phe 
625 

ttg gac cgc atg aag 
Leu Asp Arg Met Lys 
640 

att act gtt ggg att 
He Thr Val Gly He 

660 

ate ctg aaa gaa gec 
He Leu Lys Glu Ala 

675 

cgc cgc ggt tta att 
Arg Arg Gly Leu He 
690 

gtc tgg caa aag get 
Val Trp Gin Lys Ala 
705 

ctt gac cca aga aat 
Leu Asp Pro Arg Asn 
720 
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tec agt get tct gag 
Ser Ser Ala Ser Glu 

520 

etc cac acc egg gtt 
Leu His Thr Arg Val 
535 

ttt act gac tgg cag 
Phe Thr Asp Trp Gin 
550 

att ate ttt aat gaa 
He He Phe Asn Glu 
565 

cca tct aag gtt aac 
Pro Ser Lys Val Asn 

585 

gac egg ggc caa aac 
Asp Arg Gly Gin Asn 

600 

cag cct ttc aaa aaa 
Gin Pro Phe Lys Lys 
615 

aat aac ttc caa gtg 
Asn Asn Phe Gin Val 
630 

aac ttg ggc tac aag 
Asn Leu Gly Tyr Lys 
645 

get gac gtt tea gtc 
Ala Asp Val Ser Val 

665 

cac gee aag gtt gat 
His Ala Lys Val Asp 

680 

act gaa gaa gag cgt 
Thr Glu Glu Glu Arg 
695 

aag gat gaa att caa 
Lys Asp Glu He Gin 
710 

aac ate ttt atg atg 
Asn lie Phe Met Met 
725 



PCT/US02/36122 



get 1587 
Ala 



gcg 163 5 

Ala 



aaa 1683 
Lys 

att 1731 

He 

570 

ctg 1779 
Leu 



ttg 1827 
Leu 



caa 187 5 

Gin 



acc 1923 
Thr 



tac 1971 

Tyr 

650 

eta 2019 
Leu 



aaa 2067 
Lys 

tac 2115 
Tyr 

gat 2163 
Asp 

tea 2211 

Ser 

730 



gac tct ggt gec cgt ggg aat att tec aac ttc acc caa eta gee ggt 



2259 
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Asp Ser Gly Ala Arg Gly Asn lie Ser Asn Phe Thr Gin Leu Ala Gly 

735 740 745 

atg cgt ggt ttg atg gca gca cca agt ggt gag ate atg gaa ttg ccg 2307 
Met Arg Gly Leu Met Ala Ala Pro Ser Gly Glu lie Met Glu Leu Pro 

750 755 760 

ate acg tct aac ttc cgt gaa ggc ctg tct gtc tta gag atg ttt att 2355 
He Thr Ser Asn Phe Arg Glu Gly Leu Ser Val Leu Glu Met Phe He 
765 770 775 

tec acc cac ggt gec cgt aaa ggc atg acc gat acc gec ctt aaa act 2403 
Ser Thr His Gly Ala Arg Lys Gly Met Thr Asp Thr Ala Leu Lys Thr 
780 785 790 

gec gac tct ggt tac ttg acc aga cgt ttg gtt gat gtt gec caa gac 2451 
Ala Asp Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ala Gin Asp 
795 800 805 810 

gtc ate ate cga gaa gaa gac tgt ggc act aaa cgt ggc ctt aaa gtt 2499 
Val He He Arg Glu Glu Asp Cys Gly Thr Lys Arg Gly Leu Lys Val 

815 820 825 

tct gec ate caa gta gga aat gaa cag att gaa age ttg tct gac cgt 2547 
Ser Ala He Gin Val Gly Asn Glu Gin He Glu Ser Leu Ser Asp Arg 

830 835 840 

ate ttg ggt cgt tat gee caa gaa acc gtc acc cac ccc gaa act ggt 2595 
He Leu Gly Arg Tyr Ala Gin Glu Thr Val Thr His Pro Glu Thr Gly 
845 850 855 

gaa gtc att gtt cac aag gat gaa ttg att gat gaa ggc aaa acc cga 2643 
Glu Val He Val His Lys Asp Glu Leu He Asp Glu Gly Lys Thr Arg 
860 865 870 

aaa att gtc gat gec ggt att gaa gaa gtt act ate egg tct gec ttc 2691 
Lys He Val Asp Ala Gly He Glu Glu Val Thr He Arg Ser Ala Phe 
875 880 885 890 

tgc tgc aac acc aac cac ggt gtc tgc aag cac tgc tat ggc cgt aac 2739 
Cys Cys Asn Thr Asn His Gly Val Cys Lys His Cys Tyr Gly Arg Asn 

895 900 905 

ttg gca act ggc egg gaa gtt gaa gtt ggt gaa gca gtt gga act ate 2787 
Leu Ala Thr Gly Arg Glu Val Glu Val Gly Glu Ala Val Gly Thr He 

910 915 920 

get gec caa tec att ggg gaa ccc ggt acc caa ttg acc atg egg acc 2 835 

Ala Ala Gin Ser He Gly Glu Pro Gly Thr Gin Leu Thr Met Arg Thr 
925 930 935 

ttc cac act ggt ggg gtc get ggg gac gac ate acc caa ggt eta cca 2883 
Phe His Thr Gly Gly Val Ala Gly Asp Asp He Thr Gin Gly Leu Pro 
940 945 950 

egg gtt caa gaa ate ttt gaa gec cgc cat ccg aaa ggg caa gec acc 2931 
Arg Val Gin Glu He Phe Glu Ala Arg His Pro Lys Gly Gin Ala Thr 



9n 
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955 960 965 970 

att aca gaa gtg aat ggt caa ate caa gag ate gtt gaa gac cct gaa 2979 
lie Thr Glu Val Asn Gly Gin lie Gin Glu lie Val Glu Asp Pro Glu 

975 980 985 

gaa cgc act aag acc gtc act gtt aag ggg aat gtt gac caa cgt gac 3027 
Glu Arg Thr Lys Thr Val Thr Val Lys Gly Asn Val Asp Gin Arg Asp 

990 995 10 00 

tac tec ttg cca ate aat gee egg atg aag gtt gaa gtt ggg gat tat 3075 
Tyr Ser Leu Pro lie Asn Ala Arg Met Lys Val Glu Val Gly Asp Tyr 
1005 1010 1015 

gtt gaa cga ggc gat get eta aac gag ggg tct att gat ccg aaa gag 3123 
Val Glu Arg Gly Asp Ala Leu Asn Glu Gly Ser lie Asp Pro Lys Glu 
1020 1025 1030 

tta etc gcg gtg agt gat atg atg aaa ttg cag aaa tac etc ttg caa 3171 
Leu Leu Ala Val Ser Asp Met Met Lys Leu Gin Lys Tyr Leu Leu Gin 
1035 1040 1045 1050 

gaa gtc caa tac get tac egg tct caa ggg gtc gaa att ggt gac aag 3219 
Glu Val Gin Tyr Ala Tyr Arg Ser Gin Gly Val Glu lie Gly Asp Lys 

1055 1060 1065 

cac gtg gag gtt atg gtg cga caa atg etc cgt aaa gtc cgt gtc ttg 3267 
His Val Glu Val Met Val Arg Gin Met Leu Arg Lys Val Arg Val Leu 

1070 1075 1080 

caa cca ggg gac act gat ate ctg cct ggt acc atg att gac etc cac 3315 
Gin Pro Gly Asp Thr Asp He Leu Pro Gly Thr Met He Asp Leu His 
1085 1090 1095 

gac ttc aag gaa cgc aac caa gaa acc ttg atg tec ggt ggc caa ccc 3363 
Asp Phe Lys Glu Arg Asn Gin Glu Thr Leu Met Ser Gly Gly Gin Pro 
1100 1105 1110 

gca act get aga ctg gtc eta ctg ggt att acc aag gee tec ctt gaa 3411 
Ala Thr Ala Arg Leu Val Leu Leu Gly He Thr Lys Ala Ser Leu Glu 
1115 1120 1125 1130 

acc aac tct ttc ttg tct gca get tec ttc caa gaa acc acc egg gtc 3459 
Thr Asn Ser Phe Leu Ser Ala Ala Ser Phe Gin Glu Thr Thr Arg Val 

1135 1140 1145 

etc acc gat gca get att cgc ggt aaa gtt gat gac ctg gtt ggc ttg 3507 
Leu Thr Asp Ala Ala He Arg Gly Lys Val Asp Asp Leu Val Gly Leu 

1150 1155 1160 

aaa gaa aat gtt att ate ggt aaa tec ate cca get ggt act ggt atg 3555 
Lys Glu Asn Val He He Gly Lys Ser lie Pro Ala Gly Thr Gly Met 



1165 1170 1175 

aga gee tac agt aat att gaa cct aaa aaa gtt ggt gtc gtt age gaa 
Arg Ala Tyr Ser Asn He Glu Pro Lys Lys Vai Gly Val Val Ser Glu 
1180 1185 



3603 



1190 



7.1 
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aat gtc tac age ate aat gaa gaa gac caa gtc agt caa gaa gaa aac 3 651 

Asn Val Tyr Ser lie Asn Glu Glu Asp Gin Val Ser Gin Glu Glu Asn 
1195 1200 1205 1210 

cga gaa act gaa gaa act age gag aaa taa 3681 
Arg Glu Thr Glu Glu Thr Ser Glu Lys 

1215 



<210> 12 
<211> 1219 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 12 

Met Val Asp Val Asn Asn Phe Glu Ser lie Gin lie Gly Leu Ala Ser 
15 10 15 

Pro Glu Lys lie Arg Ser Trp Ser His Gly Glu Val Lys Lys Pro Glu 

20 25 30 



Thr lie Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys 
35 40 45 



Glu Arg lie Phe Gly Pro Thr Lys Asp Tyr Glu Cys Ala Cys Gly Lys 
50 55 60 



Tyr Lys Arg Val His Tyr Lys Gly lie Val Cys Asp Arg Cys Gly Val 
65 70 75 80 



Glu Val Thr Lys Ser Ser Val Arg Arg Glu Arg Met Gly His Leu Glu 

85 90 95 



Leu Ala Ala Pro Val Thr His lie Trp Tyr Phe Lys Gly He Pro Ser 

100 105 110 



Arg Met Gly Leu He Leu Asp Met Ser Pro Arg Ser Leu Glu Glu lie 
115 120 125 



He Tyr Phe Ala Ser Tyr Val Val He Asp Gly Gly Asp Thr Pro Leu 
130 135 140 



Glu Arg Lys Gin Leu Leu Thr Glu Arg Glu Tyr Arg Glu Asn Lys Ser 
145 150 155 160 



Lys Tyr Gly Asn Glu Phe Gin Ala Glu He Gly Ala Glu Ala Val Arg 
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165 170 175 



Thr Leu Leu Lys Asn Val Asp Leu Glu Gin Glu Val Ala Asp Leu Lys 

180 185 190 



Glu He Leu Glu Thr Ala Thr Gly Gin Lys Arg Thr Arg Ala He Arg 
195 200 205 



Arg Leu Asp He He Asp Ser Phe Lys Ser Ser Asn Asn Lys Pro Glu 
210 215 220 



Trp Met Val Leu Asp Ala He Pro He He Pro Pro Glu Leu Arg Pro 
225 230 235 240 



Met Val Gin Leu Glu Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp 

245 250 255 



Leu Tyr Arg Arg Val He Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu 

260 265 270 



Asp Leu Asn Ala Pro His He He Val Gin Asn Glu Lys Arg Met Leu 
275 280 285 



Gin Glu Ala Val Asp Ala Leu He Asp Asn Gly Arg Arg Gly Arg Ala 
290 295 300 



Val Asn Gly Pro Gly Asn Arg Pro Leu Lys Ser Leu Ser His Met Leu 
305 310 315 320 



Lys Gly Lys Gin Gly Arg Phe Arg Gin Asn Leu Leu Gly Lys Arg Val 

325 330 335 



Asp Tyr Ser Gly Arg Ser Val He Val Val Gly Pro Thr Leu Lys Met 

340 345 350 



Tyr Gin Cys Gly Leu Pro Lys Glu Met Ala He Glu Leu Phe Lys Pro 
355 360 365 



Phe Val Met Arg Glu Leu Val Glu Arg Asp He Ala Asn Asn He Lys 
370 375 380 



Asn Ala Lys Arg Lys Val Glu Arg Met Glu Asp Asp Val Trp Pro Val 
385 390 395 400 
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Leu Glu Asp Val lie Lys Glu His Piro Val Leu Leu Asn Arg Ala Pro 

405 410 415 



Thr Leu His Arg Leu Gly lie Gin Ala Phe Glu Pro Val Leu Val Asn 

420 425 430 



Gly Lys Ala lie Arg Leu His Pro Leu Ala Cys Glu Ala Tyr Asn Ala 
435 440 445 



Asp Phe Asp Gly Asp Gin Met Ala Val His Val Pro Leu Ser Asp Glu 
450 455 460 



Ala Gin Ala Glu Ala Arg lie Leu Met Leu Gly Ala Gin Asn He Leu 
465 470 475 480 

Asn Pro Lys Asp Gly Gin Pro Val Val Thr Pro Ser Gin Asp Met Val 

485 490 495 



Leu Gly Asn Tyr Tyr Leu Thr Met Glu Glu Glu Gly Lys He Gly Glu 

500 505 510 



Gly Thr Val Phe Ser Ser Ala Ser Glu Ala lie Gin Ala Tyr Gin Thr 
515 520 525 



Gly Tyr Val His Leu His Thr Arg Val Ala He Arg Ala Val Asp Leu 
530 535 540 



Pro Asp Lys Pro Phe Thr Asp Trp Gin Lys Asp Lys Tyr Leu He Thr 
545 550 555 560 

Thr Val Gly Lys He He Phe Asn Glu He Met Pro Ala Glu Phe Pro 

565 570 575 



Phe Leu Asn Glu Pro Ser Lys Val Asn Leu Glu Gin Gin Thr Pro Asp 

580 585 590 



Lys Tyr Phe Val Asp Arg Gly Gin Asn Leu Lys Asp Leu He Ala Asp 
595 600 605 



Arg Pro Leu Val Gin Pro Phe Lys Lys Gin Asp Leu Ser Asn He He 
610 615 620 
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Ala Glu Val Phe Asn Asn Phe Gin Val Thr Glu Thr Ser Lys Met Leu 
625 630 635 640 



Asp Arg Met Lys Asn Leu Gly Tyr Lys Tyr Ser Thr Arg Ser Gly He 

645 650 655 



Thr Val Gly He Ala Asp Val Ser Val Leu Glu Ala Lys Pro Glu He 

660 665 670 



Leu Lys Glu Ala His Ala Lys Val Asp Lys He Asn Ala Thr His Arg 
675 680 685 



Arg Gly Leu He Thr Glu Glu Glu Arg Tyr Asp Asn Val He Asp Val 
690 695 700 



Trp Gin Lys Ala Lys Asp Glu He Gin Asp Ala Leu Met Asp Ser Leu 
705 710 715 720 



Asp Pro Arg Asn Asn He Phe Met Met Ser Asp Ser Gly Ala Arg Gly 

725 730 735 



Asn He Ser Asn Phe Thr Gin Leu Ala Gly Met Arg Gly Leu Met Ala 

740 745 750 



Ala Pro Ser Gly Glu He Met Glu Leu Pro He Thr Ser Asn Phe Arg 
755 760 765 



Glu Gly Leu Ser Val Leu Glu Met Phe He Ser Thr His Gly Ala Arg 
770 775 780 



Lys Gly Met Thr Asp Thr Ala Leu Lys Thr Ala Asp Ser Gly Tyr Leu 
785 790 795 800 



Thr Arg Arg Leu Val Asp Val Ala Gin Asp Val He He Arg Glu Glu 

805 810 815 



Asp Cys Gly Thr Lys Arg Gly Leu Lys Val Ser Ala He Gin Val Gly 

820 825 830 



Asn Glu Gin He Glu Ser Leu Ser Asp Arg He Leu Gly Arg Tyr Ala 
835 840 845 
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Gin Glu Thr Val Thr His Pro Glu Thr Gly Glu Val lie Val His Lys 
850 855 860 

Asp Glu Leu lie Asp Glu Gly Lys Thr Arg Lys lie Val Asp Ala Gly 
865 870 875 880 

lie Glu Glu Val Thr lie Arg Ser Ala Phe Cys Cys Asn Thr Asn His 

885 890 895 



Gly Val Cys Lys His Cys Tyr Gly Arg Asn Leu Ala Thr Gly Arg Glu 

900 905 910 



Val Glu Val Gly Glu Ala Val Gly Thr lie Ala Ala Gin Ser He Gly 
915 920 925 



Glu. Pro Gly Thr Gin Leu Thr Met Arg Thr Phe His Thr Gly Gly Val 
930 935 940 

Ala Gly Asp Asp He Thr Gin Gly Leu Pro Arg Val Gin Glu He Phe 
945 950 955 960 

Glu Ala Arg His Pro Lys Gly Gin Ala Thr He Thr Glu Val Asn Gly 

965 970 975 



Gin He Gin Glu He Val Glu Asp Pro Glu Glu Arg Thr Lys Thr Val 

980 985 990 



Thr Val Lys Gly Asn Val Asp Gin Arg Asp Tyr Ser Leu Pro He Asn 
995 1000 1005 



Ala Arg Met Lys Val Glu Val Gly Asp Tyr Val Glu Arg Gly Asp Ala 
1010 1015 1020 



Leu Asn Glu Gly Ser He Asp Pro Lys Glu Leu Leu Ala Val Ser Asp 
1025 1030 1035 1040 



Met Met Lys Leu Gin Lys Tyr Leu Leu Gin Glu Val Gin Tyr Ala Tyr 

1045 1050 1055 



Arg Ser Gin Gly Val Glu lie Gly Asp Lys His Val Glu Val Met Val 

1060 1065 1070 



Arg Gin Met Leu Arg Lys Val Arg Val Leu Gin Pro Gly Asp Thr Asp 
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1075 1080 1085 

lie Leu Pro Gly Thr Met lie Asp Leu His Asp Phe Lys Glu Arg Asn 
1090 1095 1100 

Gin Glu Thr Leu Met Ser Gly Gly Gin Pro Ala Thr Ala Arg Leu Val 
1105 1110 1115 112° 

Leu Leu Gly He Thr Lys Ala Ser Leu Glu Thr Asn Ser Phe Leu Ser 

1125 1130 1135 



Ala Ala Ser Phe Gin Glu Thr Thr Arg Val Leu Thr Asp Ala Ala He 

1140 1145 H50 

Arg Gly Lys Val Asp Asp Leu Val Gly Leu Lys Glu Asn Val He He 
1155 1160 1165 

Gly Lys Ser He Pro Ala Gly Thr Gly Met Arg Ala Tyr Ser Asn He 
1170 1175 1180 

Glu Pro Lys Lys Val Gly Val Val Ser Glu Asn Val Tyr Ser He Asn 
1185 1150 1195 1200 

Glu Glu Asp Gin Val Ser Gin Glu Glu Asn Arg Glu Thr Glu Glu Thr 

1205 1210 1215 



Ser Glu Lys 



<210> 13 
<211> 3582 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (3582) 

<223> 

<400> 13 

gag gtg aac aag ttg gtc ggt aaa aaa gtt aat ttt ggt aaa cac cgt 
Met Asn Lys Leu Val Gly Lys Lys Val Asn Phe Gly Lys His Arg 
1 5 10 15 

gtt cgt aga agt tac tea cga ate aac gaa gta etc gag etc ccg aat 
Val Arc Arg Ser Tyr Ser Arg He Asn Glu Val Leu Glu Leu Pro Asn 

20 25 30 



48 



96 
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tta att gaa ate cag act gat tea tat gat tgg ttt tta gat gaa ggc 
Leu lie Glu lie Gin Thr Asp Ser Tyr Asp Trp Phe Leu Asp Glu Gly 

35 40 45 



144 



ttg aag gaa atg ttt agt gat att tec cca ate gat gat ttt tea ggc 192 
Leu Lys Glu Met Phe Ser Asp He Ser Pro He Asp Asp Phe Ser Gly 
50 55 60 



aat ttg tec eta gaa ttt gtt gac tat aaa ttt tac gaa age aag tat 
Asn Leu Ser Leu Glu Phe Val Asp Tyr Lys Phe Tyr Glu Ser Lys Tyr 
65 70 75 

act gtt gaa gaa get aga gag cat gat gcg aac tat tct gee ccc etc 
Thr Val Glu Glu Ala Arg Glu His Asp Ala Asn Tyr Ser Ala Pro Leu 
80 85 90 95 



caa gaa gtc ttc ttc ggt gac ttt ccg tta atg aca gaa caa ggg ace 
Gin Glu Val Phe Phe Gly Asp Phe Pro Leu Met Thr Glu Gin Gly Thr 

115 120 125 

ttt ate ate aac ggg get gag egg gtg att gtt tec caa ctt gtc egg 
Phe He He Asn Gly Ala Glu Arg Val He Val Ser Gin Leu Val Arg 
130 135 140 

teg cct ggg gtt tat tac agt cca aaa gtt gag aaa aac ggc egg gaa 
Ser Pro Gly Val Tyr Tyr Ser Pro Lys Val Glu Lys Asn Gly Arg Glu 
145 150 155 

ggt ttt tea acc gtt ctt ate cct aac egg ggt get tgg ctt gaa tac 
Gly Phe Ser Thr Val Leu He Pro Asn Arg Gly Ala Trp Leu Glu Tyr 
160 165 170 175 

gaa aca gat acc aaa ggc ate tec aat gtt cga att gac cga acc cgt 
Glu Thr Asp Thr Lys Gly He Ser Asn Val Arg He Asp Arg Thr Arg 

180 185 190 

aaa att ccg ate act gtc ttg tta aga get eta ggg att ggg tea gat 
Lys He Pro He Thr Val Leu Leu Arg Ala Leu Gly He Gly Ser Asp 

195 200 205 

gat gaa att att gac ctg ate ggc ttg aat gac age ttg gaa gec acc 
Asp Glu He He Asp Leu He Gly Leu Asn Asp Ser Leu Glu Ala Thr 
210 215 220 

ttg gaa aag gat gtc cac aag tct act tea gat tec cgc gta gaa gaa 
Leu Glu Lys Asp Val His Lys Ser Thr Ser Asp Ser Arg Val Glu Glu 
225 230 235 

gec ttg aaa gac ttg tat gaa cgc ttg cgt cca ggt gaa ccc aaa aca 
Ala Leu Lys Asp Leu Tyr Glu Arg Leu Arg Pro Gly Glu Pro Lys Thr 
240 245 250 255 

get gaa tec tct cgt aac ttg ate aat acc egg ttc ttt gac cac aag 



240 



288 



tac gtg aag tta cgt ttg ate aac aag gaa act ggt gaa gtc aag gaa 33 6 

Tyr Val Lys Leu Arg Leu He Asn Lys Glu Thr Gly Glu Val Lys Glu 

100 105 110 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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Ala Glu Ser Ser Arg Asn Leu He Asn Thx Arg Phe Phe Asp His Lys 

260 265 270 

cgt tac gac eta gec tat gtt ggt cgc tac aag atg aac aaa aaa eta 
Arg Tyr Asp Leu Ala Tyr Val Gly Arg Tyr Lys Met Asn Lys Lys Leu 

275 280 285 

gac ctt aaa acc cgc ttg atg ggg act gtc ctt gec gaa aac ctg gtt 
Asp Leu Lys Thr Arg Leu Met Gly Thr Val Leu Ala Glu Asn Leu Val 
290 295 300 

gat cct gaa get ggc gag gtc tta get gaa gaa ggt agt gaa gtg acc 
Asp Pro Glu Ala Gly Glu Val Leu Ala Glu Glu Gly Ser Glu Val Thr 
305 310 315 

egg tct gtg atg gac aag ctt ggc cct tac ctt gac ggt gac atg aac 
Arg Ser Val Met Asp Lys Leu Gly Pro Tyr Leu Asp Gly Asp Met Asn 
320 325 330 335 

caa gta acc att aac ccc tea gaa gaa gcg gtt ate cct gaa ccc att 
'Gin Val Thr He Asn Pro Ser Glu Glu Ala Val He Pro Glu Pro He 

340 345 350 

gac eta caa att gtc aaa gtc tac tec aaa gaa gat cca gac egg ate 
Asp Leu Gin He Val Lys Val Tyr Ser Lys Glu Asp Pro Asp Arg He 

355 360 365 

gtg aac atg ate ggc aac ggg cac cct gac aaa aag gec aaa tgg att 
Val Asn Met He Gly Asn Gly His Pro Asp Lys Lys Ala Lys Trp He 
370 375 380 

acc cct get gac atg ata gcg get atg agt tac ttc ttt aac etc caa 
Thr Pro Ala Asp Met He Ala Ala Met Ser Tyr Phe Phe Asn Leu Gin 
385 390 395 

gaa ggc att ggc gat gtt gac gat ate gac cac ttg ggt aac cgt egg 
Glu Gly He Gly Asp Val Asp Asp He Asp His Leu Gly Asn Arg Arg 
400 405 410 415 

ate egg tea gtc gga gag ctt ttg caa aac caa ttc cga att ggg etc 
He Arg Ser Val Gly Glu Leu Leu Gin Asn Gin Phe Arg lie Gly Leu 

420 425 430 

tct egg atg gag egg gtg gtc cgc gaa cga atg tec ate caa gac att 
Ser Arg Met Glu Arg Val Val Arg Glu Arg Met Ser He Gin Asp He 

435 440 445 

tct age acc aca ccc caa caa tta att aac ate cgt ccc gtt gta get 
Ser Ser Thr Thr Pro Gin Gin Leu He Asn He Arg Pro Val Val Ala 
450 455 460 

tct ctg aaa gaa ttt ttc ggg tct tec caa etc tec caa ttc atg gac 
Ser Leu Lys Glu Phe Phe Gly Ser Ser Gin Leu Ser Gin Phe Met Asp 
465 470 475 



caa acc aac ccc ttg ggt gag tta acc cac aaa cgt cgc ttg tct gee 
Gin Thr Asn Pro Leu Gly Glu Leu Thr His Lys Arg Arg Leu Ser Ala 



WO 03/104391 



30/235 



PCT/US02/36122 



480 



485 490 495 



ctt gga cca gga ggc ttg act agg gac egg get ggt tat gaa gtc cga 153 6 

Leu Gly Pro Gly Gly Leu Thr Arg Asp Arg Ala Gly Tyr Glu Val Arg 

500 505 510 



gac gtc cac tat tec cac tac ggc egg atg tgc ccg ate gaa aca cct 
Asp Val His Tyr Ser His Tyr Gly Arg Met Cys Pro lie Glu Thr Pro 

515 520 525 



ate aat aaa ttt ggt ttt att gaa aca cct tac cgc egg gtg gac egg 
He Asn Lys Phe Gly Phe He Glu Thr Pro Tyr Arg Arg Val Asp Arg 
545 550 555 

gaa act ggc cag gta acg gat aaa att gac tac ttg act get gac gaa 
Glu Thr Gly Gin Val Thr Asp Lys He Asp Tyr Leu Thr Ala Asp Glu 
560 565 570 575 

gaa gat ctt tac gtt gta gec caa gee aat get gaa tta gat gaa gat 
Glu Asp Leu Tyr Val Val Ala Gin Ala Asn Ala Glu Leu Asp Glu Asp 

580 585 590 

gga cat ttt get aat gat gtc gtc eta gee cga aga egg gat gtc aac 
Gly His Phe Ala Asn Asp Val Val Leu Ala Arg Arg Arg Asp Val Asn 

595 600 605 

gaa gag gtt gac get tec gaa gtt gac tat atg gac gtg tea cca aaa 
Glu Glu Val Asp Ala Ser Glu Val Asp Tyr Met Asp Val Ser Pro Lys 
610 615 620 

caa gtt gtt tct gtg gec aca get tec att cct ttc tta gaa aac gac 
Gin Val Val Ser Val Ala Thr Ala Ser He Pro Phe Leu Glu Asn Asp 
625 630 635 

gac tec aac egg get eta atg ggg get aac atg caa egg caa get gtt 
Asp Ser Asn Arg Ala Leu Met Gly Ala Asn Met Gin Arg Gin Ala Val 
640 645 650 655 

cct ctt atg caa cca gag tec cca eta gta gga act gga ate gaa cac 
Pro Leu Met Gin Pro Glu Ser Pro Leu Val Gly Thr Gly He Glu His 

660 665 670 

att gca gec cgt gac tct gga get gee gtt att gee aag get gac ggg 
He Ala Ala Arg Asp Ser Gly Ala Ala Val He Ala Lys Ala Asp Gly 

675 680 685 



ggt ace etc aac aac tac aag ctg get aag tac aaa egg tec aac tec 
Gly Thr Leu Asn Asn Tyr Lys Leu Ala Lys Tyr Lys Arg Ser Asn Ser 



1584 



gaa ggc cca aac att ggt ctg att aac agt ttg tct ace tat get aag 1632 
Glu Gly Pro Asn He Gly Leu He Asn Ser Leu Ser Thr Tyr Ala Lys 
530 535 540 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



gtt gtg gag tat gtt gat gee aag acg gtc aaa gtc cgt caa gee gat 2112 
Val Val Glu Tyr Val Asp Ala Lys Thr Val Lys Val Arg Gin Ala Asp 
690 695 700 



2160 



705 710 715 
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gga act tct tac aac caa aga cca att gta aaa act ggt gag gaa gtt 
Gly Thr Ser Tyr Asn Gin Arg Pro lie Val Lys Thr Gly Glu Glu Val 
720 725 730 735 

gac aaa ggc gac ate eta gca gat ggt ccg tec atg gaa aat ggt gaa 
Asp Lys Gly Asp lie Leu Ala Asp Gly Pro Ser Met Glu Asn Gly Glu 

740 745 750 

atg gec ctt ggt aaa aac cca ttg att gec ttt acc acc ttt gat ggc 
Met Ala Leu Gly Lys Asn Pro Leu He Ala Phe Thr Thr Phe Asp Gly 

755 760 765 

tac aac ttc gag gat gec gtc att atg agt gaa cgt ttg gtc aaa gat 
Tyr Asn Phe Glu Asp Ala Val He Met Ser Glu Arg Leu Val Lys Asp 
770 775 780 

gac gtt tat acc tec ate cac att gaa gaa tat gaa tct gaa gec cgc 
Asp Val Tyr Thr Ser He His He Glu Glu Tyr Glu Ser Glu Ala Arg 
785 790 795 

gat acc aag tta ggg cca gaa gaa ate acc egg gaa att cca aac gtc 
Asp Thr Lys Leu Gly Pro Glu Glu He Thr Arg Glu He Pro Asn Val 
800 805 810 815 

ggg gaa agt gec etc aag aac ttg gat gaa aga ggc att ate egg ate 
Gly Glu Ser Ala Leu Lys Asn Leu Asp Glu Arg Gly He He Arg lie 

820 825 830 

ggg get gaa gtt cgt gac ggg gac ate eta gtt ggt aaa gtt aca ccc 
Gly Ala Glu Val Arg Asp Gly Asp He Leu Val Gly Lys Val Thr Pro 

835 840 845 

aaa ggg gtt agt gaa eta tea get gag gaa aaa etc etc cac get ate 
Lys Gly Val Ser Glu Leu Ser Ala Glu Glu Lys Leu Leu His Ala He 
850 855 860 

ttc ggc gaa aaa gee egg gaa gtt cgt gac acc tec etc cgt gtc cca 
Phe Gly Glu Lys Ala Arg Glu Val Arg Asp Thr Ser Leu Arg Val Pro 
865 870 875 

cac ggt agt ggc gga att gtc cac gat gtc cag ate ttt acc egg gaa 
His Gly Ser Gly Gly He Val His Asp Val Gin He Phe Thr Arg Glu 
880 885 890 895 

gec ggc gac gaa ctg tea cca ggc gtt aac tac ctt gtc cga gtt ttc 
Ala Gly Asp Glu Leu Ser Pro Gly Val Asn Tyr Leu Val Arg Val Phe 

900 905 910 

•att gec caa aaa cgt aaa att gac gtt ggg gac aag atg gca ggt cga 
He Ala Gin Lys Arg Lys He Asp Val Gly Asp Lys Met Ala Gly Arg 

915 920 925 

cac ggg aac aag ggt gtt gtt tec ctt ate tta cca gaa gaa gac atg 
His Gly Asn Lys Gly Val Val Ser Leu He Leu Pro Glu Glu Asp Met 
930 935 940 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



2832 
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ccg ttt atg cca gac gga acc cca att gac ate atg etc aac cca ctt 
Pro Phe Met Pro Asp Gly Thr Pro lie Asp lie Met Leu Asn Pro Leu 
945 950 955 

ggt gtc cct tec egg atg aat gtc ggc cag gtc ate gaa etc cac atg 
Gly Val Pro Ser Arg Met Asn Val Gly Gin Val He Glu Leu His Met 
960 965 970 975 

ggg atg gca gee cga cag tta ggc gag cac att get act cca gtc ttt 
Gly Met Ala Ala Arg Gin Leu Gly Glu His He Ala Thr Pro Val Phe 

980 985 990 

gac ggg gec aac gaa gaa gat gtt tgg gaa act ate aag gaa gee ggt 
Asp Gly Ala Asn Glu Glu Asp Val Trp Glu Thr He Lys Glu Ala Gly 

995 1000 1005 

atg gat gca gat gec aaa acc gtc ttg tat gac ggc egg act ggc gag 
Met Asp Ala Asp Ala Lys Thr Val Leu Tyr Asp Gly Arg Thr Gly Glu 
1010 1015 1020 

cca ttt gac aac aag gtc tec gtt ggg gtg atg tac ttt ate aaa eta 
Pro Phe Asp Asn Lys Val Ser Val Gly Val Met Tyr Phe He Lys Leu 
1025 1030 1035 

gtc cac atg gtc gac gac aag ttg cac gee aga tec aca gga cca tac 
Val His Met Val Asp Asp Lys Leu His Ala Arg Ser Thr Gly Pro Tyr 
1040 1045 1050 1055 



2880 



caa cgc ttt ggt gag atg gaa gtc tgg gec ttg gaa get tat ggg get 
Gin Arg Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala 

1075 1080 1085 

tec cgc acc etc caa gaa ate ttg acc tac aag tea gat gac gtg att 
Ser Arg Thr Leu Gin Glu He Leu Thr Tyr Lys Ser Asp Asp Val He 
1090 1095 1100 

gga agg gta gac acc tat gaa gec att gtc aag ggc caa cgc att cca 
Gly Arg Val Asp Thr Tyr Glu Ala He Val Lys Gly Gin Arg He Pro 
1105 1110 1115 

aaa cct ggt gta cct gaa tec ttc cgt gtc etc gtg aaa gaa etc cag 
Lys Pro Gly Val Pro Glu Ser Phe Arg Val Leu Val Lys Glu Leu Gin 
1120 1125 1130 1135 



aat etc aag get gaa gat gac gag teg gaa gac caa gtc gtt gat tec 
Asn Leu Lys Ala Glu Asp Asp Glu Ser Glu Asp Gin Val Val Asp Ser 

1155 1160 1165 



2928 



2976 



3024 



3072 



3120 



3168 



tec ttg gtt acc caa caa ccc ctt ggt ggg aaa gca cag ttt ggt ggc 3216 
Ser Leu Val Thr Gin Gin Pro Leu Gly Gly Lys Ala Gin Phe Gly Gly 

1060 1065 1070 



3264 



3312 



3360 



3408 



tct ctg ggg ttg gac ctg aaa gtc etc gac aag gaa caa aac gaa ate 3456 
Ser Leu Gly Leu Asp Leu Lys Val Leu Asp Lys Glu Gin Asn Glu He 

1140 1145 1150 



3504 



eta gaa gaa atg cgt aaa gag cag gaa gaa gaa cgc cgt aag gaa aaa 



3552 
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Leu Glu Glu Met Arg Lys Glu Gin Glu Glu Glu Arg Arg Lys Glu Lys 
1170 1175 1180 

gaa aaa gaa gag cca agt act gag teg taa 3582 
Glu Lys Glu Glu Pro Ser Thr Glu Ser 
1185 1190 



<210> 14 
<211> 1192 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 14 

Met Asn Lys Leu Val Gly Lys Lys Val Asn Phe Gly Lys His Arg Val 
15 10 15 



Arg Arg Ser Tyr Ser Arg lie Asn Glu Val Leu Glu Leu Pro Asn Leu 

20 25 30 



He Glu He Gin Thr Asp Ser Tyr Asp Trp Phe Leu Asp Glu Gly Leu 
35 40 45 



Lys Glu Met Phe Ser Asp He Ser Pro He Asp Asp Phe Ser Gly Asn 
50 55 60 



Leu Ser Leu Glu Phe Val Asp Tyr Lys Phe Tyr Glu Ser Lys Tyr Thr 
65 70 75 80 



Val Glu Glu Ala Arg Glu His Asp Ala Asn Tyr Ser Ala Pro Leu Tyr 

85 90 95 



Val Lys Leu Arg Leu He Asn Lys Glu Thr Gly Glu Val Lys Glu Gin 

100 105 110 



Glu Val Phe Phe Gly Asp Phe Pro Leu Met Thr Glu Gin Gly Thr Phe 
115 120 125 



He He Asn Gly Ala Glu Arg Val He Val Ser Gin Leu Val Arg Ser 
130 135 140 



Pro Gly Val Tyr Tyr Ser Pro Lys Val Glu Lys Asn Gly Arg Glu Gly 
145 150 155 ' 160 



Phe Ser Thr Val Leu He Pro Asn Arg Gly Ala Trp Leu Glu Tyr Glu 

165 170 175 
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Thr Asp Thr Lys Gly lie Ser Asn Val Arg lie Asp Arg Thr Arg Lys 

180 185 190 



lie Pro lie Thr Val Leu Leu Arg Ala Leu Gly He Gly Ser Asp Asp 
195 200 205 



Glu He He Asp Leu He Gly Leu Asn Asp Ser Leu Glu Ala Thr Leu 
210 215 220 



Glu Lys Asp Val His Lys Ser Thr Ser Asp Ser Arg Val Glu Glu Ala 
225 230 235 240 



Leu Lys Asp Leu Tyr Glu Arg Leu Arg Pro Gly Glu Pro Lys Thr Ala 

245 250 255 



Glu Ser Ser Arg Asn Leu He Asn Thr Arg Phe Phe Asp His Lys Arg 

260 265 270 



Tyr Asp Leu Ala Tyr Val Gly Arg Tyr Lys Met Asn Lys Lys Leu Asp 
275 280 285 



Leu Lys Thr Arg Leu Met Gly Thr Val Leu Ala Glu Asn Leu Val Asp 
290 295 300 



Pro Glu Ala Gly Glu Val Leu Ala Glu Glu Gly Ser Glu Val Thr Arg 
305 310 315 320 



Ser Val Met Asp Lys Leu Gly Pro Tyr Leu Asp Gly Asp Met Asn Gin 

325 330 335 



Val Thr He Asn Pro Ser Glu Glu Ala Val He Pro Glu Pro He Asp 

340 345 350 



Leu Gin He Val Lys Val Tyr Ser Lys Glu Asp Pro Asp Arg He Val 
355 360 365 



Asn Met He Gly Asn Gly His Pro Asp Lys Lys Ala Lys Trp He Thr 
370 375 380 



Pro Ala Asp Met He Ala Ala Met Ser Tyr Phe Phe Asn Leu Gin Glu 
385 390 395 400 
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Gly He Gly Asp Val Asp Asp He Asp His Leu Gly Asn Arg Arg He 

405 410 415 



Arg Ser Val Gly Glu Leu Leu Gin Asn Gin Phe Arg He Gly Leu Ser 

420 425 430 



Arg Met Glu Arg Val Val Arg Glu Arg Met Ser He Gin Asp He Ser 
435 440 445 



Ser Thr Thr Pro Gin Gin Leu He Asn He Arg Pro Val Val Ala Ser 
450 455 460 



Leu Lys Glu Phe Phe Gly Ser Ser Gin Leu Ser Gin Phe Met Asp Gin 
465 470 475 480 



Thr Asn Pro Leu Gly Glu Leu Thr His Lys Arg Arg Leu Ser Ala Leu 

485 490 495 



Gly Pro Gly Gly Leu Thr Arg Asp Arg Ala Gly Tyr Glu Val Arg Asp 

500 505 510 



Val His Tyr Ser His Tyr Gly Arg Met Cys Pro He Glu Thr Pro Glu 
515 520 525 



Gly Pro Asn He Gly Leu He Asn Ser Leu Ser Thr Tyr Ala Lys He 
530 535 ^ 540 



Asn Lys Phe Gly Phe He Glu Thr Pro Tyr Arg Arg Val Asp Arg Glu 
545 550 555 560 



Thr Gly Gin Val Thr Asp Lys He Asp Tyr Leu Thr Ala Asp Glu Glu 

565 570 575 



Asp Leu Tyr Val Val Ala Gin Ala Asn Ala Glu Leu Asp Glu Asp Gly 

580 585 590 



His Phe Ala Asn Asp Val Val Leu Ala Arg Arg Arg Asp Val Asn Glu 
595 600 605 



Glu Val Asp Ala Ser Glu Val Asp Tyr Met Asp Val Ser Pro Lys Gin 
610 615 620 



Val Val Ser Val Ala Thr Ala Ser He Pro Phe Leu Glu Asn Asp Asp 
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625 630 635 640 



Ser Asn Arg Ala Leu Met Gly Ala Asn Met Gin Arg Gin Ala Val Pro 

645 650 655 



Leu Met Gin Pro Glu Ser Pro Leu Val Gly Thr Gly lie Glu His lie 

660 665 670 



Ala Ala Arg Asp Ser Gly Ala Ala Val He Ala Lys Ala Asp Gly Val 
675 680 685 



Val Glu Tyr Val Asp Ala Lys Thr Val Lys Val Arg Gin Ala Asp Gly 
690 695 700 



Thr Leu Asn Asn Tyr Lys Leu Ala Lys Tyr Lys Arg Ser Asn Ser Gly 
705 710 715 720 



Thr Ser Tyr Asn Gin Arg Pro He Val Lys Thr Gly Glu Glu Val Asp 

725 730 735 



Lys Gly Asp He Leu Ala Asp Gly Pro Ser Met Glu Asn Gly Glu Met 

740 745 750 



Ala Leu Gly Lys Asn .Pro Leu He Ala Phe Thr Thr Phe Asp Gly Tyr 
755 760 765 



Asn Phe Glu Asp Ala Val He Met Ser Glu Arg Leu Val Lys Asp Asp 
770 775 780 



Val Tyr Thr Ser He His He Glu Glu Tyr Glu Ser Glu Ala Arg Asp 
785 790 795 800 



Thr Lys Leu Gly Pro Glu Glu He Thr Arg Glu lie Pro Asn Val Gly 

805 810 815 



Glu Ser Ala Leu Lys Asn Leu Asp Glu Arg Gly He He Arg He Gly 

820 825 830 



Ala Glu Val Arg Asp Gly Asp He Leu Val Gly Lys Val Thr Pro Lys 
835 840 845 



Gly Val Ser Glu Leu Ser Ala Glu Glu Lys Leu Leu His Ala He Phe 
850 855 860 
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Gly Glu Lys Ala Arg Glu Val Arg Asp Thr Ser Leu Arg Val Pro His 
865 870 875 880 



Gly Ser Gly Gly lie Val His Asp Val Gin lie Phe Thr Arg Glu Ala 

885 890 895 



Gly Asp Glu Leu Ser Pro Gly Val Asn Tyr Leu Val Arg Val Phe lie 

900 905 910 



Ala Gin Lys Arg Lys lie Asp Val Gly Asp Lys Met Ala Gly Arg His 
915 920 925 



Gly Asn Lys Gly Val Val Ser Leu lie Leu Pro Glu Glu Asp Met Pro 
930 935 940 



Phe Met Pro Asp Gly Thr Pro lie Asp lie Met Leu Asn Pro Leu Gly 
945 950 955 960 



Val Pro Ser Arg Met Asn Val Gly Gin Val lie Glu Leu His Met Gly 

965 970 975 



Met Ala Ala Arg Gin Leu Gly Glu His lie Ala Thr Pro Val Phe Asp 

980 985 990 



Gly Ala Asn Glu Glu Asp Val Trp Glu Thr lie Lys Glu Ala Gly Met 
995 1000 1005 



Asp Ala Asp Ala Lys Thr Val Leu Tyr Asp Gly Arg Thr Gly Glu Pro 
1010 1015 1020 



Phe Asp Asn Lys Val Ser Val Gly Val Met Tyr Phe lie Lys Leu Val 
1025 1030 1035 1040 



His Met Val Asp Asp Lys Leu His Ala Arg Ser Thr Gly Pro Tyr Ser 

1045 1050 1055 



Leu Val Thr Gin Gin Pro Leu Gly Gly Lys Ala Gin Phe Gly Gly Gin 

1060 1065 1070 



Arg Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala Ser 
1075 1080 1085 
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Arg Thr Leu Gin Glu lie Leu Thr Tyr Lys Ser Asp Asp Val lie Gly 
1090 1095 1100 



Arg Val Asp Thr Tyr Glu Ala He Val Lys Gly Gin Arg He Pro Lys 
1105 1110 1115 1120 



Pro Gly Val Pro Glu Ser Phe Arg Val Leu Val Lys Glu Leu Gin Ser 

1125 1130 1135 



Leu Gly Leu Asp Leu Lys Val Leu Asp Lys Glu Gin Asn Glu He Asn 

1140 1145 1150 



Leu Lys Ala Glu Asp Asp Glu Ser Glu Asp Gin Val Val Asp Ser Leu 
1155 1160 1165 



Glu Glu Met Arg Lys Glu Gin Glu Glu Glu Arg Arg Lys Glu Lys Glu 
1170 1175 1180 



Lys Glu Glu Pro Ser Thr Glu Ser 
1185 1190 



<210> 15 
<211> 1407 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25) . . (1407) 

<223> 

<400> 15 

aaagaccagg aaaggaagaa gacc ttg gca act aat att cat gaa gac cgc 

Met Ala Thr Asn He His Glu Asp Arg 
1 5 



51 



ctg cca cca caa aat att gaa gcg gag caa tec gtc tta ggg tec gtc 99 

Leu Pro Pro Gin Asn He Glu Ala Glu Gin Ser Val Leu Gly Ser Val 
10 15 20 25 

etc tta aat gca gaa gec ttg gtg gcg gec atg gaa tat gtg gat gaa 147 

Leu Leu Asn Ala Glu Ala Leu Val Ala Ala Met Glu Tyr Val Asp Glu 

30 35 40 

gat gac ttt tac egg egg gec cac cag ttg ate ttt aag gee atg ata 195 

Asp Asp Phe Tyr Arg Arg Ala His Gin Leu He Phe Lys Ala Met He 

45 50 55 



gac etc tat gaa gac aac cag gec att gat gtc att acc att aaa gac 



243 
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Asp Leu Tyr Glu Asd Asn Gin Ala lie Asp Val He Thr He Lys Asp 
60 " 65 70 

aag ctg gaa gcc aat gac cag ttg gag gat ate ggg ggt gec tct tac 
Lys Leu Glu Ala Asn Asp Gin Leu Glu Asp He Gly Gly Ala Ser Tyr 
75 80 85 

eta get gag att get ggg gtc ace cca ace gca get aac gtg tec tat 
Leu Ala Glu He Ala Gly Val Thr Pro Thr Ala Ala Asn Val Ser Tyr 
90 95 100 105 

tac get aag att gtg gaa gat egg tct ctt ttg cgc aac ttg att gcg 
Tyr Ala Lys He Val Glu Asp Arg Ser Leu Leu Arg Asn Leu He Ala 

110 115 120 

aca get aat gag att gcc cag tct ggc tac gaa gac cat gac gat gtg 
Thr Ala Asn Glu He Ala Gin Ser Gly Tyr Glu Asp His Asp Asp Val 

125 130 135 

cca gaa gtt tta aac aat get gag cag aag ate ttg cag gtt tct gaa 
"Pro. Glu Val Leu Asn Asn Ala Glu Gin Lys He Leu Gin Val Ser Glu 
140 * 145 150 



ace ate gag cat att gat gaa etc cac caa agg gat gaa gag ate ace 
Thr He Glu His He Asp Glu Leu His Gin Arg Asp Glu Glu He Thr 
170 175 180 185 

ggg att tea act ggc tac ccc tac ctg gac agg atg act tea ggc ctt 
Gly He Ser Thr Gly Tyr Pro Tyr Leu Asp Arg Met Thr Ser Gly Leu 

190 195 200 



acg get ttt gcc ttg aat gtc gcc caa aat ate ggg aca gcc aca gat 
Thr Ala Phe Ala Leu Asn Val Ala Gin Asn He Gly Thr Ala Thr Asp 
220 225 230 

gaa act att gcg att ttt tec ctt gag atg ggg get gaa cag ctg gtc 
Glu Thr He Ala lie Phe Ser Leu Glu Met Gly Ala Glu Gin Leu Val 
235 240 245 

aac egg atg tta tgt tea gaa ggc agt att gat gcc act aac etc cga 
Asn Arg Met Leu Cys Ser Glu Gly Ser He Asp Ala Thr Asn Leu Arg 
250 255 260 265 

* 

aat ggc aag eta acg ccg gaa gaa tat gac cgt ttg ttt gtg gcc atg 
Asn Gly Lys Leu Thr Pro Glu Glu Tyr Asp Arg Leu Phe Val Ala Met 

270 275 280 



291 



339 



387 



435 



483 



aaa cga aac egg ace ggc ttt get agt att tea gaa ate etc cac caa 531 
Lys Arg Asn Arg Thr Gly Phe Ala Ser He Ser Glu He Leu His Gin 
155 160 165 



579 



627 



cat gaa gat gag ttg att att gtc gca gca aga ccg ggt gtg ggg aaa 675 
His Glu Asp Glu Leu He He Val Ala Ala Arg Pro Gly Val Gly Lys 

205 210 215 



723 



771 



819 



867 



ggg age ttg tct gaa get gat att tat att gat gac act ccc ggc ate 915 
Gly Ser Leu Ser Glu Ala Asp He Tyr He Asp Asp Thr Pro Gly He 
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285 290 295 

egg aca get gaa ate egg gee aag tgc cgc cgc ctg gtc caa gag aag 
Arg Thr Ala Glu lie Arg Ala Lys Cys Arg Arg Leu Val Gin Glu.Lys 
300 305 310 

gga agt ctg ggc ttg att gtc att gac tac ctg caa ttg ate gaa gga 
Gly Ser Leu Gly Leu lie Val lie Asp Tyr Leu Gin Leu lie Glu Gly 
315 320 325 

get tea aac tat gaa tec aga cag cag cag gtg tct gat ata tct egg 
Ala Ser Asn Tyr Glu Ser Arg Gin Gin Gin Val Ser Asp He Ser Arg 
330 335 340 345 

cag ctg aag aag ctt tct aag gaa ctt tct gtc cca gtt att gee ctg 
Gin Leu Lys Lys Leu Ser Lys Glu Leu Ser Val Pro Val He Ala Leu 

350 355 360 

tea caa ctg tec egg agt gtg gaa cag aga caa gac aag egg ccc ate 
Ser Gin Leu Ser Arg Ser Val Glu Gin Arg Gin Asp Lys Arg Pro He 

365 370 375 

etc agt gac ttg egg gaa tea ggg teg att gaa cag gat gee gat att 
Leu Ser Asp Leu Arg Glu Ser Gly Ser He Glu Gin Asp Ala Asp He 
380 385 390 

gtg gee ttc ctt tac egg gag gac tac tac caa aat gaa gaa gat ate 
Val Ala Phe Leu Tyr Arg Glu Asp Tyr Tyr Gin Asn Glu Glu Asp He 
395 400 405 

gat gag gac ttt gtc gat aat age gtg gaa gtc att ate gaa aaa aac 
Asp Glu Asp Phe Val Asp Asn Ser Val Glu Val He He Glu Lys Asn 
410 415 420 425 

egg tea gga get cga gga aca gtc aag ttg aac ttt aag aaa gag ttc 
Arg Ser Gly Ala Arg Gly Thr Val Lys Leu Asn Phe Lys Lys Glu Phe 

430 435 440 

aac aaa ttt ace teg att tct tac egg tct gaa gat gaa- gtc cca gee 
Asn Lys Phe Thr Ser He Ser Tyr Arg Ser Glu Asp Glu Val Pro Ala 

445 450 455 

aac ttt ggc tag 
Asn Phe Gly 
460 



963 



1011 



1059 



1107 



1155 



1203 



1251 



1299 



1347 



1395 



1407 



<210> 16 
<211> 460 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 16 

Met Ala Thr Asn He His Glu Asp Arg Leu Pro Pro Gin Asn He Glu 
15 10 15 
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Ala Glu Gin Ser Val Leu Gly Ser Val Leu Leu Asn Ala Glu Ala Leu 

20 25 30 

Val Ala Ala Met Glu Tyr Val Asp Glu Asp Asp Phe Tyr Arg Arg Ala 
35 40 45 

His Gin Leu lie Phe Lys Ala Met lie Asp Leu Tyr Glu Asp Asn Gin 
50 55 60 

Ala He Asp Val lie Thr He Lys Asp Lys Leu Glu Ala Asn Asp Gin 
65 70 75 80 

Leu Glu Asp He Gly Gly Ala Ser Tyr Leu' Ala Glu He Ala Gly Val 

85 90 95 



Thr Pro Thr Ala Ala Asn Val Ser Tyr Tyr Ala Lys He Val Glu Asp 

100 105 HO 

Arg Ser Leu Leu Arg Asn Leu He Ala Tnr Ala Asn Glu He Ala Gin 
115 120 125 

Ser Gly Tyr Glu Asp His Asp Asp Val Pro Glu Val Leu Asn Asn Ala 
130 135 140 

Glu Gin Lys lie Leu Gin Val Ser Glu Lys Arg Asn Arg Thr Gly Phe 
145 150 155 160 

Ala Ser He Ser Glu He Leu His Gin Thr He Glu His He Asp Glu 

165 170 175 



Leu His Gin Arg Asp Glu Glu He Thr Gly He Ser Thr Gly Tyr Pro 

180 185 190 

Tyr Leu Asp Arg Met Thr Ser Gly Leu His Glu Asp Glu Leu He He 
195 200 205 

Val Ala Ala Arg Pro Gly Val Gly Lys Thr Ala Phe Ala Leu Asn Val 
210 215 220 

Ala Gin Asn He Gly Thr Ala Thr Asp Glu Thr He Ala He Phe Ser 
225 230 235 240 



Leu Glu Met Gly Ala Glu Gin Leu Val Asn Arg Met Leu Cys Ser Glu 
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245 



250 255 



Gly Ser lie Asp Ala Thr Asn Leu Arg Asn Gly Lys Leu Thr Pro Glu 

260 265 270 

Glu Tyr Asp Arg Leu Phe Val Ala Met Gly Ser Leu Ser Glu Ala Asp 
275 280 285 

lie Tyr lie Asp Asp Thr Pro Gly He Arg Thr Ala Glu He Arg Ala 
290 295 300 

Lys Cys Arg Arg Leu Val Gin Glu Lys Gly Ser Leu Gly Leu He Val 
305 310 315 320 

He Asp Tyr Leu Gin Leu He Glu Gly Ala Ser Asn Tyr Glu Ser Arg 

325 330 335 

Gin Gin Gin Val Ser Asp He Ser Arg Gin Leu Lys Lys Leu Ser Lys 

340 345 350 

Glu Leu Ser Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Ser Val 
355 360 365 

Glu Gin Arg Gin Asp Lys Arg Pro He Leu Ser Asp Leu Arg Glu Ser 
370 375 380 

Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg Glu 
385 390 395 400 

Asp Tyr Tyr Gin Asn Glu Glu Asp He Asp Glu Asp Phe Val Asp Asn 

405 410 415 

Ser Val Glu Val He He Glu Lys Asn Arg Ser Gly Ala Arg Gly Thr 

420 425 430 

Val Lys Leu Asn Phe Lys Lys Glu Phe Asn Lys Phe Thr Ser He Ser 
435 440 445 



Tyr Arg Ser Glu Asp Glu Val Pro Ala Asn Phe Gly 
450 455 460 



<210> 17 
<211> 2484 
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<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (10) . . (2484) 

<223> 



ggg att get gtt ggg atg gca acc aat ate cca ccc cac aat ctt aag 
Gly lie Ala Val Gly Met Ala Thr Asn lie Pro Pro His Asn Leu Lys 
175 180 185 190 



99 



147 



195 



243 



291 



agga^gala ttg ttt ttg gaa gag aga gat age cgt tta gaa cag att aag 51 
Met Phe Leu Glu Glu Arg Asp Ser Arg Leu Glu Gin lie Lys 
1.5 1° 

ctg tec aag gag atg aaa aac tea ttc tta gac tat gec atg agt gtc 
Leu Ser Lys Glu Met Lys Asn Ser Phe Leu Asp Tyr Ala Met Ser Val 
15 20 25 30 

ate gtc tec egg gee eta ccc gat gtc egg gac ggc ttg aag ccg gtt 
lie Val Ser Arg Ala Leu Pro Asp Val Arg Asp Gly Leu Lys Pro Val 

35 40 45 

cac cga aga ate ctg tac gga atg aat gaa ctg ggc tta acc ccg gac 
His Arg Arg He Leu Tyr Gly Met Asn Glu Leu Gly Leu Thr Pro Asp 

50 55 60 

aag tct tat aaa aag tct gec cgt att gta ggg gat gtt atg ggg aaa 
Lys Ser Tyr Lys Lys Ser Ala Arg He Val Gly Asp Val Met Gly Lys 
65 70 75 

tac cac ccc cac ggt gac act get att tat gac tec atg gtc aga atg 
Tyr His Pro His Gly Asp Thr Ala He Tyr Asp Ser Met Val Arg Met 
80 85 90 

gee cag gac ttt tct tac cga gtt ccc tta gtg gac ggc cat ggg aac 
Ala Gin Asp Phe Ser Tyr Arg Val Pro Leu Val Asp Gly His Gly Asn 
95 100 105 HO 

ttt ggg teg gtt gac ggg gac gga get get gee atg egg tat acc gaa 
Phe Gly Ser Val Asp Gly Asp Gly Ala Ala Ala Met Arg Tyr Thr Glu 

115 120 125 

gee egg atg tec aag atg gee ttg gaa etc ctg cga gac ate aac aag 
Ala Arg Met Ser Lys Met Ala Leu Glu Leu Leu. Arg Asp He Asn Lys 

130 135 I 40 

gat acc att gac tac cac gat aac tat gat ggg act gag teg gaa ccc 
Asp Thr He Asp Tyr His Asp Asn Tyr Asp Gly Thr Glu Ser Glu Pro 
145 150 155 

gat ate ctt cct gec cgc ttc ccc aac etc tta gtc aac ggg get teg 531 
Asp He Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser 
160 165 170 



339 



387 



435 



483 



579 
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gaa gtc att gat gcc tgc gtc etc ttg atg gaa aat gag gat gtg act 
Glu Val lie Asp Ala Cys Val lieu Leu Met Glu Asn Glu Asp Val Thr 

135 200 205 



627 



gtg get gac ctt atg gaa gtc tta cca gga cct gac ttt ccg act ggg 675 
Val Ala Asp Leu Met Glu Val Leu Pro Gly Pro Asp Phe Pro Thr Gly 

210 215 220 

get tec ctt att ggt gtt tct ggc gtc cgc aag get tat gag acc ggt 723 
Ala Ser Leu lie Gly Val Ser Gly Val Arg Lys Ala Tyr Glu Thr Gly 
225 230 235 



cgt ggg tec att aaa tta egg gcc aag tec egg ate gat gtc gac caa 
Arg Gly Ser lie Lys Leu Arg Ala Lys Ser Arg He Asp Val Asp Gin 
240 245 250 

aaa ggt aag gaa aga att att ate gac gaa att cct tac atg gtc aac 
Lys Gly Lys Glu Arg He He He Asp Glu He Pro Tyr Met Val Asn 
255 260 265 270 

aag gcc aaa ttg gtc gaa aag att gcg gaa ctg get egg gac aag aaa 
Lys Ala Lys Leu Val Glu Lys He Ala Glu Leu Ala Arg Asp Lys Lys 

275 280 285 

att gac ggc att acc gat tta aat gat gag tct gac egg gaa ggc ttg 
He Asp Gly He Thr Asp Leu Asn Asp Glu Ser Asp Arg Glu Gly Leu 

290 295 300 

egg att gtg ate gat gta cgc egg gat act tct get ggt ata tta ctt 
Arg He Val He Asp Val Arg Arg Asp Thr Ser Ala Gly He Leu Leu 
305 310 315 

aac aag ctt tac aaa atg acc caa ttg cag gtt tct ttt ggc ttt aac 
Asn Lys Leu Tyr Lys Met Thr Gin Leu Gin Val Ser Phe Gly Phe Asn 
320 325 330 

atg ctg get ate gtc gat ggg gtg ccc aaa acc ttg ggc etc aaa gac 
Met Leu Ala He Val Asp Gly Val Pro Lys Thr Leu Gly Leu Lys Asp 
335 340 345 350 

ate ctg acc cac tac tta gac cat caa aaa act gtt ate cgc agg egg 
He Leu Thr His Tyr Leu Asp His Gin Lys Thr Val He Arg Arg Arg 

355 360 365 

aca gag ttt gac aag aac aag get gaa teg egg gcc cac ate tta gaa 
Thr Glu Phe Asp Lys Asn Lys Ala Glu Ser Arg Ala His He Leu Glu 

370 375 380 

ggg ctt egg act gcc tta gac cat ate gat gcc att att acc att ate 
Gly Leu Arg Thr Ala Leu Asp His He Asp Ala He He Thr He He 
385 390 395 



771 



819 



867 



915 



963 



1011 



1059 



1107 



1155 



1203 



cgt cag tec cag caa get gaa gaa gcc aaa agt caa ttg atg get tct 1251 
Arg Gin Ser Gin Gin Ala Glu Glu Ala Lys Ser Gin Leu Met Ala Ser 
400 405 410 



tat gac etc tct gac cgt caa gcc cag gcg att tta gac atg egg atg 



1299 
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Tyr Asp Leu Ser Asp Arg Gin Ala Gin Ala Xle Leu Asp Met Arg Met 
415 420 425 430 

gtc egg ttg act ggt ttg gaa aga gag aaa att gaa gat gaa tac get 
Val Arg Leu Thr Gly Leu Glu Arg Glu Lys He Glu Asp Glu Tyr Ala 

435 440 445 

gaa etc tta gaa aaa ate gag gac ttg cgt gac ate ttg gee egg cca 
Glu Leu Leu Glu Lys He Glu Asp . Leu Arg Asp He Leu Ala Arg Pro 

450 455 460 

gaa egg ate aag caa att ate aaa gaa gaa atg ate gaa att get gaa 
Glu Arg lie Lys Gin He He Lys Glu Glu Met He Glu He Ala Glu 
465 470 475 

aaa cac ggc caa gac cgc eta act gac ate egg gtt ggg gaa gag ttg 
Lys His Gly Gin Asp Arg Leu Thr Asp He Arg Val Gly Glu Glu Leu 
480 485 490 

age att gaa gac gaa gac ttg att gaa gaa gaa gat ate ate att ace 
Ser He Glu Asp Glu Asp Leu He Glu Glu Glu Asp He He He Thr 
495 500 505 510 

ctg tct cga aaa ggc tat ate aaa egg atg ccg get gga gaa ttc aag 
Leu Ser Arg Lys Gly Tyr He Lys Axg Met Pro Ala Gly Glu Phe Lys 

515 520 525 

gee caa aac cgc ggt ggc cgt ggg gtt aag ggg atg act ace aac gat 
Ala Gin- Asn Arg Gly Gly Arg Gly Val Lys Gly Met Thr Thr Asn Asp 

530 535 540 

ggg gac ttt gtt gaa cag ctg act ttc tgt tct agt cat gac caa ate 
Gly Asp Phe Val Glu Gin Leu Thr Phe Cys Ser Ser His Asp Gin He 
545 550 555 

etc ttc ttt ace aac caa ggc aag gtt tat aag ate aag gec tac gaa 
Leu Phe Phe Thr Asn Gin Gly Lys Val Tyr Lys He Lys Ala Tyr Glu 
560 565 570 

ate ccg gaa tat ggg cgt aat gee aag gga att cct gee ate aac ttt 
He Pro Glu Tyr Gly Arg Asn Ala Lys Gly He Pro Ala He Asn Phe 
575 580 585 590 

tta aat ata gat aaa gat gaa tat att caa gee atg gtc aac ttg act 
Leu Asn He Asp Lys Asp Glu Tyr He Gin Ala Met Val Asn Leu Thr 

595 600 605 



egg gtc aaa egg acg gee cag tct gaa ttt caa aat ate aga agt age 

Arg Val Lys Arg Thr Ala Gin Ser Glu Phe Gin Asn He Arg Ser Ser 
625 630 635 

ggg ttg aac gcg ate aat eta aat gaa ggc gat gaa ttg gtt aac gtg 

Gly Leu Asn Ala He Asn Leu Asn Glu Gly Asp Glu Leu Val Asn Val 



1347 



1395 



1443 



1491 



1539 



1587 



1635 



1683 



1731 



1779 



1827 



gac cag gca gat gac cag gac caa ttc ttc ttt gcg aca aga ctt ggc 1875 
Asp Gin Ala Asp Asp Gin Asp Gin Phe Phe Phe Ala Thr Arg Leu Gly 

610 615 620 



1923 



1971 
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640 645 650 

gtc cct acc cac aat gac cag gcc att ate ctg gec age cag caa ggc 
Val Pro Thr His Asn Asp Gin Ala He He Leu Ala Ser Gin Gin Gly 
655 660 665 670 

tat gcg gtc tac ttt gat gaa aaa gat ate cgt age atg ggt cga ggg 
Tyx Ala Val Tyr Phe Asp Glu Lys Asp He Arg Ser Met Gly Arg Gly 

675 680 685 

get gca ggt gtc cgt gga att cgc tta ggt gat ggc gac aca gtg gtt 
Ala Ala Gly Val Arg Gly He Arg Leu Gly Asp Gly Asp Thr Val Val 

690 695 700 

gcc atg gaa gtc tta gag ccg ggc caa gac gta tta gtc att act gaa 
Ala Met Glu Val Leu Glu Pro Gly Gin Asp Val Leu Val He Thr Glu 
705 710 715 

aaa ggg tac ggc aaa cga acc tec caa gaa gag tac acc etc cac aag 
Lys Gly Tyr Gly Lys Arg Thr Ser Gin Glu Glu Tyr Thr Leu His Lys 
720 725 730 



gac aat caa gtt aac caa acc gtt gag gaa taa 
Asp Asn Gin Val Asn Gin Thr Val Glu Glu 
815 820 



2019 



2067 



2115 



2163 



2211 



cga ggg ggc aag ggg gtt aaa acc ctt cat att acc gat aag aat ggt 2259 
Arg Gly Gly Lys Gly Val Lys Thr Leu His He Thr Asp Lys Asn Gly 
735 740 745 750 



2355 



ccc eta att gga ctg aaa act gtc tct ggt ggt gag gac gtc atg att 2307 
Pro Leu He Gly Leu Lys Thr Val Ser Gly Gly Glu Asp Val Met He 

755 760 765 

gtc acc gac caa ggt ate atg att cgt ate gaa gcc gac age ate tct 
Val Thr Asp Gin Gly He Met He Arg He Glu Ala Asp Ser He Ser 

770 775 780 

cag acc tec cgc eta acc caa ggt gtc cgt tta att cga ctt gaa gaa 
Gin Thr Ser Arg Leu Thr Gin Gly Val Arg Leu He Arg Leu Glu Glu 
785 790 795 

gat age egg gtg tea acg gta gcc etc att gat att gac caa gag ctt 2451 
Asp Ser Arg Val Ser Thr Val Ala Leu He Asp He Asp Gin Glu Leu 
800 805 810 



2403 



2484 



<210> 18 
<211> 824 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 18 

Met Phe Leu Glu Glu Arg Asp Ser Arg Leu Glu Gin He Lys Leu Ser 
15 10 15 
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Lys Glu Met Lys Asn Ser Phe Leu Asp Tyr Ala Met Ser Val He Val 

20 25 30 

Ser Arg Ala Leu Pro Asp Val Arg Asp Gly Leu Lys Pro Val His Arg 
35 40 45 

Arg He Leu Tyr Gly Met Asn Glu Leu Gly Leu Thr Pro Asp Lys Ser 
50 55 60 

Tyr Lys Lys Ser Ala Arg He Val Gly Asp Val Met Gly Lys Tyr His 
65 70 75 80 

Pro His Gly Asp Thr Ala He Tyr Asp Ser Met Val Arg Met Ala Gin 

85 90 95 

Asp Phe Ser Tyr Arg Val Pro Leu Val Asp Gly His Gly Asn Phe Gly 

100 105 HO 

Ser Val Asp Gly Asp Gly Ala Ala Ala Met Arg Tyr Thr Glu Ala Arg 
115 120 125 

Met Ser Lys Met Ala Leu Glu Leu Leu Arg Asp He Asn Lys Asp Thr 
130 135 140 

He Asp Tyr His Asp Asn Tyr Asp Gly Thr Glu Ser Glu Pro Asp He 
145 150 155 160 

Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser Gly He 

165 170 175 

Ala Val Gly Met Ala Thr Asn He Pro Pro His Asn Leu Lys Glu Val 

180 185 190 

He Asp Ala Cys Val Leu Leu Met Glu Asn Glu Asp Val Thr Val Ala 
195 200 205 

Asp Leu Met Glu Val Leu Pro Gly Pro Asp Phe Pro Thr Gly Ala Ser 
210 215 220 

Leu He Gly Val Ser Gly Val Arg Lys Ala Tyr Glu Thr Gly Arg Gly 
225 230 235 240 

Ser He Lys Leu Arg Ala Lys Ser Arg He Asp Val Asp Gin Lys Gly 
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Lys Glu Arg lie 

260 



Lys Leu Val Glu 
275 



Gly lie Thr Asp 
290 



Val He Asp Val 
305 



Leu Tyr Lys Met 



Ala lie Val Asp 

340 



Thr His Tyx Leu 
• 355 



Phe Asp Lys Asn 
370 



Arg Thr Ala Leu 
385 



Ser Gin Gin Ala 



Leu Ser Asp Arg 

420 



Leu Thr Gly Leu 
435 



Leu Glu Lys He 
450 



245 



He He Asp Glu 



Lys He Ala Glu 

280 



Leu Asn Asp Glu 
295 



Arg Arg Asp Thr 
310 



Thr Gin Leu Gin 
325 



Gly Val Pro Lys 



Asp His Gin Lys 

360 



Lys Ala Glu Ser 
375 



Asp His He Asp 
390 



Glu Glu Ala Lys 
405 



Gin Ala Gin Ala 



Glu Arg Glu Lys 

440 



Glu Asp Leu Arg 
455 



250 



He Pro Tyr Met 
265 



Leu Ala Arg Asp 



Ser Asp Arg Glu 

300 



Ser Ala Gly He 
315 



Val Ser Phe Gly 
330 



Thr Leu Gly Leu 
345 



Thr Val He Arg 



Arg Ala His He 

380 



Ala He He Thr 
395 



Ser Gin Leu Met 
410 



lie Leu Asp Met 

425 . 



He Glu Asp Glu 



Asp He Leu Ala 

460 



255 



Val Asn Lys Ala 
270 



Lys Lys He Asp 
285 



Gly Leu Arg He 



Leu Leu Asn Lys 

320 



Phe Asn Met Leu 
335 



Lys Asp He Leu 
350 



Arg Arg Thr Glu 
365 



Leu Glu Gly Leu 



He He Arg Gin 

400 



Ala Ser Tyr Asp 
415 



Arg Met Val Arg 
430 



Tyr Ala Glu Leu 
445 



Arg Pro Glu Arg 



He Lys Gin He He Lys Glu Glu Met He Glu He Ala Glu Lys His 
465 470 475 480 
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Gly Gin Asp Arg Leu Thr Asp lie Arg Val Gly Glu Glu Leu Ser He 

485 490 495 

Glu Asp Glu Asp Leu He Glu Glu Glu Asp He He He Thr Leu Ser 

500 505 510 

Arg Lys Gly Tyr He Lys Arg Met Pro Ala Gly Glu Phe Lys Ala Gin 
515 520 525 

Asn Arg Gly Gly Arg Gly Val Lys Gly Met Thr Thr Asn Asp Gly Asp 
530 535 540 

Phe Val Glu Gin Leu Thr Phe Cys Ser Ser His Asp Gin He Leu Phe 
545 550 555 560 

Phe Thr Asn Gin Gly Lys Val Tyr Lys He Lys Ala Tyr Glu He Pro 

565 570 575 

Glu Tyr Gly Arg Asn Ala Lys Gly He Pro Ala He Asn Phe Leu Asn 

580 585 590 

He Asp Lys Asp Glu Tyr He Gin Ala Met Val Asn Leu Thr Asp Gin 
595 600 605 

Ala Asp Asp Gin Asp Gin Phe Phe Phe Ala Thr Arg Leu Gly Arg Val 
610 615 620 

Lys Arg Thr Ala Gin Ser Glu Phe Gin Asn He Arg Ser Ser Gly Leu 
625 630 635 640 

Asn Ala He Asn Leu Asn Glu Gly Asp Glu Leu Val Asn Val Val Pro 

645 650 655 

Thr His Asn Asp Gin Ala He He Leu Ala Ser Gin Gin Gly Tyr Ala 

660 665 670 

Val Tyr Phe Asp Glu Lys Asp He Arg Ser Met Gly Arg Gly Ala Ala 
675 680 685 

Gly Val Arg Gly He Arg Leu Gly Asp Gly Asp Thr Val Val Ala Met 
690 695 700 
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Glu Val Leu Glu Pro Gly Gin Asp Val Leu Val lie Thr Glu Lys Gly 
705 710 715 720 

Tyr Gly Lys Arg Thr Ser Gin Glu Glu Tyr Thr Leu His Lys Arg Gly 

725 730 735 

Gly Lys Gly Val Lys Thr Leu His lie Thr Asp Lys Asn Gly Pro Leu 

740 745 750 

He Gly Leu Lys Thr Val Ser Gly Gly Glu Asp Val Met He Val Thr 
755 760 765 

Asp Gin Gly He Met He Arg He Glu Ala Asp Ser He Ser Gin Thr 
770 775 780 

Ser Arg Leu Thr Gin Gly Val Arg Leu He Arg Leu Glu Glu Asp Ser 
785 790 795 800 

Arg Val Ser Thr Val Ala Leu He Asp He Asp Gin Glu Leu Asp Asn 

805 810 815 



Gin Val Asn Gin Thr Val Glu Glu 

820 



<210> 19 
<211> 1956 
<212> UNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (1956) 

<223> 

<400> 19 ' 

cgtgta atg get gaa gat aga cca tta aca cca aat gag tta gca gaa 

Met Ala Glu Asp Arg Pro Leu Thr Pro Asn Glu Leu Ala Glu 

1 5 10 

ctg aaa aaa aca tat gat get agt caa ate caa gtc tta gaa ggc eta 
Leu Lys Lys Thr Tyr Asp Ala Ser Gin He Gin Val Leu Glu Gly Leu 
15 20 25 30 

gaa gca gtc aga gta egg ccg ggt atg tac att ggg tec acc age aag 
Glu Ala Val Arg Val Arg Pro Gly Met Tyr He Gly Ser Thr Ser Lys 

35 40 45 

gaa ggc etc cac cac ttg gta tgg gag ate gtg gac aat get att gac 



48 



96 



144 



192 



WO 03/104391 



51/235 



PCT/US02/36122 



Glu Gly Leu His His Leu Val Trp Glu lie Val Asp Asn Ala He Asp 

50 55 60 

gaa get atg gec ggt tat gec gac aag att tct gtt tec att ttg gaa 
Glu Ala Met Ala Gly Tyr Ala Asp Lys He Ser Val Ser He Leu Glu 
65 70 75 

ggc gac gtg ate caa gtg get gat aac ggc egg ggc ate ccg gtt gat 
Gly Asp Val He Gin Val Ala Asp Asn Gly Arg Gly He Pro Val Asp 
80 85 90 

ate cag gaa aaa aca ggc egg cca get gtt gaa act gtc ttt aca gtc 
He Gin Glu Lys Thr Gly Arg Pro Ala Val Glu Thr Val Phe Thr Val 
95 100 105 110 

etc cac get ggt ggg aaa ttt ggt ggc ggt ggt tac aag gtt tec ggt 
Leu His Ala Gly Gly Lys Phe Gly Gly Gly Gly Tyr Lys Val Ser Gly 

115 120 125 

ggt ctg cac ggt gta ggg tct tct gtg gtc aat get etc tec gaa tac 
Gly Leu His Gly Val Gly Ser Ser Val Val Asn Ala Leu Ser Glu Tyr 

130 135 140 

etc caa gtc cag gtg cac cga gat ggt aaa ate tac caa caa gtt tac 
Leu Gin Val Gin Val His Arg Asp Gly Lys He Tyr Gin Gin Val Tyr 
145 150 155 

aag egg ggc ttg gtt gat tct gac ttg gaa gtg gtg ggt gag act gac 
Lys Arg Gly Leu Val Asp Ser Asp Leu Glu Val Val Gly Glu Thr Asp 
160 165 170 

cac act gga act att gtt acc ttt aag gca gat agt ttg att ttt aaa 
His Thr Gly Thr He Val Thr Phe Lys Ala Asp Ser Leu He Phe Lys 
175 180 185 190 

gac act act tct tat gac ttc aat acc tta gec acc egg ate egg gag 
Asp Thr Thr Ser Tyr Asp Phe Asn Thr Leu Ala Thr Arg He Arg Glu 

195 200 205 

ttg gec ttc tta aac cga ggc ttg aat att tec ate gaa gac aaa egg 
Leu Ala Phe Leu Asn Arg Gly Leu Asn He Ser He Glu Asp Lys Arg 

210 215 220 

caa gca ggc ggg cag tct ttg aac tac cac tat gaa ggt ggg ata teg 
Gin Ala Gly Gly Gin Ser Leu Asn Tyr His Tyr Glu Gly Gly He Ser 
225 230 235 

agt tat gtt gac cac ttg aat tec age cgt gaa gtt ctt tat gag acc 
Ser Tyr Val Asp His Leu Asn Ser Ser Arg Glu Val Leu Tyr Glu Thr 
240 245 250 

cca att ttc ttg gaa ggg gaa gaa gaa ggg att tct gtg gaa att gec 
Pro He Phe Leu Glu Gly Glu Glu Glu Gly He Ser Val Glu He Ala 
255 260 265 270 

etc cag cat acc gat age ttc cat act aat tta atg agt ttt gec aat 
Leu Gin His Thr Asp Ser Phe His Thr Asn Leu Met Ser Phe Ala Asn 



240 



288 



336 



3 84 



432 



480 



528 



576 



624 



672 
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275 



280 285 



aac ate cac acc tat gag ggt ggc atg cat att tec ggc ttc aag aca 
Asn lie His Thr Tyr Glu Gly Gly Met His He Ser Gly Phe Lys Tlir 

290 295 300 

gec ctt acc egg gcg gtc aac gac tat gee egg cag aat aac ttg etc 
Ala Leu Thr Arg Ala Val Asn Asp Tyr Ala Arg Gin Asn Asn Leu Leu 
305 310 315 

cga gag tea gag gat aac ttt acc ggc gat gac gtt egg gaa ggt ctg 
Arg Glu Ser Glu Asp Asn Phe Thr Gly Asp Asp Val Arg Glu Gly Leu 
320 325 330 

acg gtg gtt ttg tea ate aag cac cca gac ccc caa ttt gaa gga caa 
Thr Val Val Leu Ser He Lys His Pro Asp Pro Gin Phe Glu Gly Gin 
335 340 345 350 

acc aag act aag ctg ggg aac tct gaa gtc aga ggg ata att gac egg 
Thr Lys- Thr Lys Leu Gly Asn Ser Glu Val Arg Gly He He Asp Arg 

355 360 365 

etc ttt age cag cac ttt gaa cgt tac etc atg gaa aat cca aag gtt 
Leu Phe Ser Gin His Phe Glu Arg Tyr Leu Met Glu Asn Pro Lys Val 

370 375 380 

ggt aag egg att gtt gac aag gcg ctt ttg get tec aaa gec cgc caa 
Gly Lys Arg He Val Asp Lys Ala Leu Leu Ala Ser Lys Ala Arg Gin 
385 390 395 

gca gee aag aga gec egg gaa gtc acc egg aag aaa tea ggc tta gaa 
Ala Ala Lys Arg Ala Arg Glu Val Thr Arg Lys Lys Ser Gly Leu Glu 
400 405 410 

att age aac eta cca ggt aaa tta get gac tgt tct tec aaa gat cct 
He Ser Asn Leu Pro Gly Lys Leu Ala Asp Cys Ser Ser Lys Asp Pro 
415 420 425 430 

gaa gaa tec gaa etc ttt att gta gaa ggg gat tea get gga ggg teg 
Glu Glu Ser Glu Leu Phe He Val Glu Gly Asp Ser Ala Gly Gly Ser 

435 440 445 

get aag caa ggt egg tec egg gtt ttc cag get att ttg ccg att cgt 
Ala Lys Gin Gly Arg Ser Arg Val Phe Gin Ala He Leu Pro He Arg 

450 455 460 

ggt aag att ttg aat gtc gaa aaa gee age att gac cgt ate tta gec 
Gly Lys He Leu Asn Val Glu Lys Ala Ser He Asp Arg He Leu Ala 
465 470 475 

aat gaa gaa ate egg tct etc ttt aca gee atg gga act ggc ttc ggg 
Asn Glu Glu He Arg Ser Leu Phe Thr Ala Met Gly Thr Gly Phe Gly 
480 485 490 

gaa gaa ttt aat gtt gaa gaa get cgc tac aat aag tta att ate atg 
Glu Glu Phe Asn Val Glu Glu Ala Arg Tyr Asn Lys Leu He He Met 
495 500 505 510 
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aca gat get gat gtt gac gga gec cac att egg acc ttg etc ttg ace 
Thr Asp Ala Asp Val Asp Gly Ala His lie Arg Thr Leu Leu Leu Thr 

515 520 525 

ctt ctt tac egg tat atg egg ccc ttg att gaa gca ggt ttc gtc tac 
Leu Leu Tyr Arg Tyr Met Arg Pro Leu lie Glu Ala Gly Phe Val Tyr 

530 535 540 

att gec cag cca ccc etc tac cag gtc aag caa ggc aag aag gtt aaa 
lie Ala Gin Pro Pro Leu Tyr Gin Val Lys Gin Gly Lys Lys Val Lys 
545 550 555 

tac ttt gat agt gac egg gaa ctg gac tec tac ttg aaa gaa att cct 
Tyr Phe Asp Ser Asp Arg Glu Leu Asp Ser Tyr Leu Lys Glu lie Pro 
560 565 570 



1584 



gaa aat gee cgt tac gtg gaa aat ate gat ate tag 
Glu Asn Ala Arg Tyr Val Glu Asn lie Asp lie 
640 645 



1632 



1680 



1728 



gac tea ccc aag cct tct gtc caa cgc tac aaa ggc tta gga gaa atg 177 6 

Asp Ser Pro Lys Pro Ser Val Gin Arg Tyr Lys Gly Leu Gly Glu Met 
575 580 585 590 

gat get gag cag ttg tgg gaa acc acc atg aac cca gaa cac cgc cgc 1824 
Asp Ala Glu Gin Leu Trp Glu Thr Thr Met Asn Pro Glu His Arg Arg 

595 600 605 

tta ctt egg gta gac gta gac gac gec att gag get gac act att ttt 1872 
Leu Leu Arg Val Asp Val Asp Asp Ala He Glu Ala Asp Thr He Phe 

610 615 620 

gac atg ttg atg ggt gag gat gtc aaa ccc egg cgc gac ttt ate aaa 1920 
Asp Met Leu Met Gly Glu Asp Val Lys Pro Arg Arg Asp Phe He Lys 
625 630 635 



1956 



<210> 20 
<211> 649 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 20 

Met Ala Glu Asp Arg Pro Leu Thr Pro Asn Glu Leu Ala Glu Leu Lys 
1 5 10 15 

Lys Thr Tyr Asp Ala Ser Gin He Gin Val Leu Glu Gly Leu Glu Ala 

20 25 • 30 

Val Arg Val Arg Pro Gly Met Tyr He Gly Ser Thr Ser Lys Glu Gly 
35 40 45 



Leu His His Leu Val Trp Glu He Val Asp Asn Ala He Asp Glu Ala 
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50 



55 60 



Met Ala Gly Tyr Ala Asp Lys lie Ser Val Ser lie Leu Glu Gly Asp 
65 70 75 80 

Val lie Gin Val Ala Asp Asn Gly Arg Gly He Pro Val Asp He Gin 

85 90 95 

•s 

Glu Lys Thr Gly Arg Pro Ala Val Glu Thr Val Phe Thr Val Leu His 

100 105 HO 

Ala Gly Gly Lys Phe Gly Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu 
115 120 125 

His Gly Val Gly Ser Ser Val Val Asn Ala Leu Ser Glu Tyr Leu Gin 
130 135 1^0 

Val Gin Val His Arg Asp Gly Lys He Tyr Gin Gin Val Tyr Lys Arg 
145 150 155 160 

Gly Leu Val Asp Ser Asp Leu Glu Val Val Gly Glu Thr Asp His Thr 

165 170 175 

Glv Thr He Val Thr Phe Lys Ala Asp Ser Leu He Phe Lys Asp Thr 

180 185 190 

Thr Ser Tyr Asp Phe Asn Thr Leu Ala Thr Arg He Arg Glu Leu Ala 
195 200 205 

Phe Leu Asn Arg Gly Leu Asn He Ser He Glu Asp Lys Arg Gin Ala 
210 215 220 

Gly Gly Gin Ser Leu Asn Tyr His Tyr Glu Gly Gly He Ser Ser Tyr 
225 230 235 240 

Val Asp His Leu Asn Ser Ser Arg Glu Val Leu Tyr Glu Thr Pro He 

245 250 255 

Phe Leu Glu Gly Glu Glu Glu Gly He Ser Val Glu He Ala Leu Gin 

260 265 270 

His Thr Asp Ser Phe His Thr Asn Leu Met Ser Phe Ala Asn Asn He 
275 280 285 
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His Thr Tyr Glu Gly Gly Met His He Ser Gly Phe Lys Thr Ala Leu 
290 295 300 

Thr Arg Ala Val Asn Asp Tyr Ala Arg Gin Asn Asn Leu Leu Arg Glu 
305 310 315 "° 

Ser Glu Asp Asn Phe Thr Gly Asp Asp Val Arg Glu Gly Leu Thr Val 

325 330 335 

Val Leu Ser He Lys His Pro Asp Pro Gin Phe Glu Gly Gin Thr Lys 

340 345 350 

Thr Lys Leu Gly Asn Ser Glu Val Arg Gly He He Asp Arg Leu Phe 
355 360 365 

Ser Gin His Phe Glu Arg Tyr Leu Met Glu Asn Pro Lys Val Gly Lys 
370 375 380 

Arg He Val Asp Lys Ala Leu Leu Ala Ser Lys Ala Arg Gin Ala Ala 
385 390 395 400 

Lys Arg Ala Arg Glu Val Thr Arg Lys Lys Ser Gly Leu Glu lie Ser 

405 410 415 

Asn Leu Pro Gly Lys Leu Ala Asp Cys Ser Ser Lys Asp Pro Glu Glu 

420 425 430 

Ser Glu Leu Phe He Val Glu Gly Asp Ser Ala Gly Gly Ser Ala Lys 
435 440 445 

Gin Gly Arg Ser Arg Val Phe Gin Ala He Leu Pro He Arg Gly Lys 
450 455 460 

He Leu Asn Val Glu Lys Ala Ser He Asp Arg He Leu Ala Asn Glu 
465 470 475 480 

Glu lie Arg Ser Leu Phe Thr Ala Met Gly Thr Gly Phe Gly Glu Glu 

485 490 495 

Phe Asn Val Glu Glu Ala Arg Tyr Asn Lys Leu He He Met Thr Asp 

500 505 510 
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Ala Asp Val Asp Gly Ala His He Arg Thr Leu Leu Leu Thr Leu Leu 
515 520 525 

Tyr Arg Tyr Met Arg Pro Leu He Glu Ala Gly Phe Val Tyr He Ala 
530 535 540 

Gin Pro Pro Leu Tyr Gin Val Lys Gin Gly Lys Lys Val Lys Tyr Phe 
545 550 555 560 

Asp Ser Asp Arg Glu Leu Asp Ser Tyr Leu Lys Glu He Pro Asp Ser 

565 570 575 

Pro Lys Pro Ser Val Gin Arg Tyr Lys Gly Leu Gly Glu Met Asp Ala 

580 585 590 

Glu Gin Leu Trp Glu Thr Thr Met Asn Pro Glu His Arg Arg Leu Leu 
595 600 605 

Arg Val Asp Val Asp Asp Ala He Glu Ala Asp Thr He Phe Asp Met 
610 615 620 

Leu Met Gly Glu Asp Val Lys Pro Arg Arg Asp Phe He Lys Glu Asn 
625 630 635 640 



Ala Arg Tyr Val Glu Asn He Asp He 

645 



<210> 21 
<211> 1218 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (1218) 

<223> 

<400> 21 

agacctaatc atttt ttg aaa tgg aga aag aca aaa acc ate tat ggt aca 

Met Lys Trp Arg Lys Thr Lys Thr He Tyr Gly He 
1 5 10 

ctt aag aac aaa agg aag ttt gga ggg att ttt ttg aaa ttt tea gta 
Leu Lys Asn Lys Arg Lys Phe Gly Gly He Phe Leu Lys Phe Ser Val 
15 20 25 

aaa egg acg gaa ttt eta aaa gta tta aaa aaa gta cag att gca gtg 
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99 
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Lvs Arg Thr Glu Phe Leu Lys Val Leu Lys Lys Val Gin lie Ala Val 
30 35 40 

tct tct aaa agt acc ate get ate ttg ace ggg att aaa tta gaa gcg 
Ser Ser Lys Ser Thr lie Ala lie Leu Thr Gly He Lys Leu Glu Ala 
45 50 55 60 

gat aac cag ggt tta acc tta acc gga tct aac teg gat ate tea gtt 
Asp Asn Gin Gly Leu Thr Leu Thr Gly Ser Asn Ser Asp He Ser Val 

65 70 75 

gaa agt tac tta tct gtg acc gat gaa ggg gcg gat ttg gtt att gat 
Glu Ser Tyr Leu Ser Val Thr Asp Glu Gly Ala Asp Leu Val He Asp 

80 85 90 

gag ccg ggg cag att gtc ttg caa cca gec egg tta ttt gec aat ate 
Glu Pro Gly Gin He Val Leu Gin Pro Ala Arg Leu Phe Ala Asn He 
95 100 105 

gtc caa aaa tta ccg gac acc cac ttt aag gta aac gtt age caa ggc 
Val Gin Lys Leu Pro Asp Thr His Phe Lys Val Asn Val Ser Gin Gly 
HO 115 120 

cag caa acc caa ate acc tea get tea gee tec ttt act ate aac ggc 
Gin Gin Thr Gin He Thr Ser Ala Ser Ala Ser Phe Thr He Asn Gly 
125 130 135 140 

att gac gec atg tec tac ccc cac ttg cca gat ate gac ctg gag gaa 
He Asp Ala Met Ser Tyr Pro His Leu Pro Asp He Asp Leu Glu Glu 

145 150 155 

tec ttt acc ctg ccg gtt gac etc ttt aaa aac atg ate aac cag act 
Ser Phe Thr Leu Pro Val Asp Leu Phe Lys Asn Met He Asn Gin Thr 

160 165 170 

gtc ate gca gtc tec aac cat gaa agt egg ccc ate eta act ggg gtt 
Val He Ala Val Ser Asn His Glu Ser Arg Pro He Leu Thr Gly Val 
175 180 185 

aac eta tct etc aaa gag ggc cga etc aag gca gtg gca acc gac age 
Asn Leu Ser Leu Lys Glu Gly Arg Leu Lys Ala Val Ala Thr Asp Ser 
190 195 200 

cac cgt ttg teg caa egg tec ate caa tta gag tea gcg cct gat att 
His Arg Leu Ser Gin Arg Ser He Gin Leu Glu Ser Ala Pro Asp lie 
205 210 215 220 

tec ttt gac att gtg ata cca ggt aag tct ttg act gaa ctg act aag 
Ser Phe Asp He Val He Pro Gly Lys Ser Leu Thr Glu Leu Thr Lys 

225 230 235 

ttg atg gat gca gat gaa gaa gtc egg gta gec ate age gac aac caa 
Leu Met Asp Ala Asp Glu Glu Val Arg Val Ala He Ser Asp Asn Gin 

240 24 5 250 

ate eta ttt gee etc tec age age cag ttt tac tct egg etc eta gaa 
He Leu Phe Ala Leu Ser Ser Ser Gin Phe Tyr Ser Arg Leu Leu Glu 
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255 260 265 

ggt aag tat cct gat acc gac cgc ttg ate cca ggc gac acc cca acg 
Gly Lys Tyr Pro Asp Thr Asp Arg Leu lie Pro Gly Asp Thr Pro Thr 
270 275 280 



tec etc etc tec cat gaa ggg aaa aac aat gtg gtc caa etc aca gtg 
Ser Leu Leu Ser His Glu Gly Lys Asn Asn Val Val Gin Leu Thr Val 

305 310 315 



gtc caa gaa gaa att gac ttt ggc cac ttc caa ggc caa gac tta acc 
Val Gin Glu Glu lie Asp Phe Gly His Phe Gin Gly Gin Asp Leu Thr 
335 340 345 

att tct ttc aac ccc gac tac tta aaa gag gec ttg get acc ttt ggt 
He Ser Phe Asn Pro Asp Tyr Leu Lys Glu Ala Leu Ala Thr Phe Gly 
350 355 360 

caa gga gaa att aag ttg aaa tta gtt teg acc ttg cga ccc ttt gtc 
Gin Gly Glu He Lys Leu Lys Leu Val Ser Thr Leu Arg Pro Phe Val 
365 370 375 380 

ate gtc cca agt gag gac caa gga gac ttt ate caa ctt att act cca 
He Val Pro Ser Glu Asp Gin Gly Asp Phe He Gin Leu He Thr Pro 

385 390 395 

ate cga aca gee taa 
He Arg Thr Ala 

400 



<210> 22 
<211> 400 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 22 

Met Lys Trp Arg Lys Thr Lys Thr He Tyr Gly He Leu Lys Asn Lys 
1 5 10 15 

Arg Lys Phe Gly Gly He Phe Leu Lys Phe Ser Val Lys Arg Thr Glu 

20 25 30 



Phe Leu Lys Val Leu Lys Lys Val Gin He Ala Val Ser Ser Lys Ser 
35 40 45 



867 



gaa ate acc ttg gac acc aag gaa tta cag ggg get gtt gac egg get 915 
Glu He Thr Leu Asp Thr Lys Glu Leu Gin Gly Ala Val Asp Arg Ala 
285 290 295 300 



963 



act get gaa aag ttg gaa ate gaa ggc cag tea get gaa gtg ggc cat 1011 
Thr Ala Glu Lys Leu Glu He Glu Gly Gin Ser Ala Glu Val Gly His 

320 325 330 
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Thr lie Ala lie Leu Thr Gly He' Lys Leu Glu Ala Asp Asn Gin Gly 
50 55 60 

Leu Thr Leu Thr Gly Ser Asn Ser Asp He Ser Val Glu Ser Tyr Leu 
65 70 75 80 

Ser Val Thr Asp Glu Gly Ala Asp Leu Val He Asp Glu Pro Gly Gin 

85 90 95 

lie Val Leu Gin Pro Ala Arg Leu Phe Ala Asn He Val Gin Lys Leu 

100 105 HO 

Pro Asp Thr His Phe Lys Val Asn Val Ser Gin Gly Gin Gin Thr Gin 
115 120 125 

lie Thr Ser Ala Ser Ala Ser Phe Thr He Asn Gly He Asp Ala Met 
130 135 140 

Ser Tyr Pro His Leu Pro Asp He Asp Leu Glu Glu Ser Phe Thr Leu 
145 150 155 160 

Pro Val Asp Leu Phe Lys Asn Met He Asn Gin Thr Val He Ala Val 

165 170 175 

Ser Asn His Glu Ser Arg Pro He Leu Thr Gly Val Asn Leu Ser Leu 

180 185 190 

Lys Glu Gly Arg Leu Lys Ala Val Ala Thr Asp Ser His Arg Leu Ser 
195 200 205 

Gin Arg Ser He Gin Leu Glu Ser Ala Pro Asp He Ser Phe Asp He 
210 215 220 

Val He Pro Gly Lys Ser Leu Thr Glu Leu Thr Lys Leu Met Asp Ala 
225 230 235 240 

Asp Glu Glu Val Arg Val Ala He Ser Asp Asn Gin He Leu Phe Ala 

245 250 255 

Leu Ser Ser Ser Gin Phe Tyr Ser Arg Leu Leu Glu Gly Lys Tyr Pro 

260 265 270 

Asp Thr Asp Arg Leu He Pro Gly Asp Thr Pro Thr Glu He Thr Leu 
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275 



280 285 



Asp Thr Lys Glu Leu Gin Gly Ala Val Asp Arg Ala Ser Leu Leu Ser 
290 295 300 

His Glu Gly Lys Asn Asn Val Val Gin Leu Thr Val Thr Ala Glu Lys 
305 310 315 320 

Leu Glu He Glu Gly Gin Ser Ala Glu Val Gly His Val Gin Glu Glu 

325 330 335 

He Asp Phe Gly His Phe Gin Gly Gin Asp Leu Thr He Ser Phe Asn 

340 345 350 

Pro Asp Tyr Leu Lys Glu Ala Leu Ala Thr Phe Gly Gin Gly Glu He 
355 360 365 

Lys Leu Lys Leu Val Ser Thr Leu Arg Pro Phe Val He Val Pro Ser 
370 375 380 

Glu Asp Gin Gly Asp Phe He Gin Leu He Thr Pro He Arg Thr Ala 
385 390 395 400 



<210> 23 
<211> 1317 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25) . . (1317) 

<223> 

<400> 23 = . 
tcaataactg cttttttagg agct ttg cag atg aat tgg aaa gaa acc ate 

Met Gin Met Asn Trp Lys Glu Thr lie 

1 5 

agt etc ate aac acc acc egg ggg acc gga gac aag aaa aat ttg aac 
Ser Leu He Asn Thr Thr Arg Gly Thr Gly Asp Lys Lys Asn Leu Asn 
10 15 20 25 

egg atg cga ctt tta etc aaa gag eta ggt aat cct gaa aca gac ttg 
Arg Met Arg Leu Leu Leu Lys Glu Leu Gly Asn Pro Glu Thr Asp Leu 

30 35 40 
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ccg gtc ate cac gtt get ggc acc aat ggc aaa ggg acg ace tgt get 
Pro val He His Val Ala Gly Thr Asn Gly Lys Gly Thr Thr Cys Ala 

45 50 55 

tat att gee cac age ttg gee cgt get ggt tat aaa aca gga ctt tac 
Tyr He Ala His Ser Leu Ala Arg Ala Gly Tyr Lys Thr Gly Leu Tyr 
60 65 70 

acc age ccc cac ctg gag egg gtc aat gaa egg ate egg att aat gac 
Thr Ser Pro His Leu Glu Arg Val Asn Glu Arg He Arg He Asn Asp 
75 80 85 

cgc tac ata tec gac caa gac tta atg get ttg acc ggt caa att gec 
Arg Tyr He Ser Asp Gin Asp Leu Met Ala Leu Thr Gly Gin He Ala 
90 95 100 105 

ccc ate att gac cat eta gaa gac tgc ttg ggt gag aaa tac tat tct 
Pro He He Asp His Leu Glu Asp Cys Leu Gly Glu Lys Tyr Tyr Ser 

110 115 120 

ttt gaa att tta act gee ctt gec ttc ttg tac ttc cag caa gca ggg 
Phe Glu He Leu Thr Ala Leu Ala Phe Leu Tyr Phe Gin Gin Ala Gly 

125 130 I 35 

gtg gac ttt tta gtt tta gaa act ggg gta ggg gga aaa att gat gcg 
Val Asp Phe Leu Val Leu Glu Thr Gly Val Gly Gly Lys He Asp Ala 
140 145 150 

acc aat gtg gtg ccc get cca ctg gtc tea gtc att ate tct att ggc 
Thr Asn Val Val Pro Ala Pro Leu Val Ser Val He He Ser He Gly 
155 160 165 

tat gac cac acc cat gtc ttg ggt aat acc ctg gaa gac att acc egg 
Tyr Asp His Thr His Val Leu Gly Asn Thr Leu Glu Asp He Thr Arg 
170 175 180 185 

cac aag gca ggg att att aag aaa ggc tgt ccg gtg gtg gtg ggc cct 
His Lys Ala Gly He He Lys Lys Gly Cys Pro Val Val Val Gly Pro 

190 195 200 

ctt gec gac cat tta ttg get att gtt aaa gag gtg tec aaa gaa atg 
Leu Ala Asp His Leu Leu Ala He Val Lys Glu Val Ser Lys Glu Met 

205 210 215 

gac agt aat tta acc att gtc cat ccc gac aag ttt gac att gtt cat 
Asp Ser Asn Leu Thr He Val His Pro Asp Lys Phe Asp He Val His 
220 225 230 

caa acc ctt gac tac cag tec ttt aaa tac ggt ggg gac ttg gtt tta 
Gin Thr Leu Asp Tyr Gin Ser Phe Lys Tyr Gly Gly Asp Leu Val Leu 
235 240 245 

gag act caa atg att ggt aac cac cag ctg gta aac act gec eta get 
Glu Thr Gin Met He Gly Asn His Gin Leu Val Asn Thr Ala Leu Ala 
250 255 260 265 

tat gaa gec ttg aag att gtc caa caa tct tac ccc gat ttg aca gat 
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243 



291 



339 
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Tyr Glu Ala Leu Lys lie Val Gin Gin Ser Tyr Pro Asp Leu Thr Asp 

270 275 280 

tta gat ata tta gaa ggc ttg aag acg acc cac tgg cca ggc egg atg 
Leu Asp He Leu Glu Gly Leu Lys Thr Thr His Trp Pro Gly Arg Met 

285 290 295 

caa aag eta tct gac cag cca gtg gtt gtt ctt gat ggg gee cac aac 
Gin Lys Leu Ser Asp Gin Pro Val Val Val Leu Asp Gly Ala His Asn 
300 305 310 

gaa ate ggg gtc aag get ctt aga cag tea att gac cac ttt ttc ccc 
Glu lie Gly Val Lys Ala Leu Arg Gin Ser He Asp His Phe Phe Pro 
315 320 325 

* 

ggc aaa aaa ate acc tat ttt gec gga atg atg gtc gaa aaa gac ttc 
Gly Lys Lys He Thr Tyr Phe Ala Gly Met Met Val Glu Lys Asp Phe 
330 335 340 345 

gec aaa atg ttt gac etc ctg ggg gaa aca get gat aaa ttt tac ttg 
Ala Lys Met Phe Asp Leu Leu Gly Glu Thr Ala Asp Lys Phe Tyr Leu 

350 355 360 

att tea ccc gat ttg act cgc ggt ttt gat gtc gac caa gec gtt caa 
He Ser Pro Asp Leu Thr Arg Gly Phe Asp Val Asp Gin Ala Val Gin 

365 370 375 

tct ttg act gac aag ggc tac cag get tec agt gtg get age etc caa 
Ser Leu Thr Asp Lys Gly Tyr Gin Ala Ser Ser Val Ala Ser Leu Gin 
380 385 390 

gee ate tta gac tac ata aac cag caa gca aaa gca gat gaa att ate 
Ala He Leu Asp Tyr He Asn Gin Gin Ala Lys Ala Asp Glu He He 
395 400 405 

att ate ttt ggc tec etc tac ttg gtt ggc gac ttc eta aaa ctt tac 
He He Phe Gly Ser Leu Tyr Leu Val Gly Asp Phe Leu Lys Leu Tyr 
410 415 420 425 

cat gaa gca tec ggt taa 
His Glu Ala Ser Gly 

430 



915 



963 



1011 



1059 



1107 



1155 



1203 



1251 



1299 



1317 



<210> 24 
<211> 430 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 24 

Met Gin Met Asn Trp Lys Glu Thr He Ser Leu He Asn Thr Thr Arg 
15 10 15 



Gly Thr Gly Asp Lys Lys Asn Leu Asn Arg Met Arg Leu Leu Leu Lys 

20 25 30 
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Glu Leu Gly Asn Pro Glu Thr Asp Leu Pro Val lie His Val Ala Gly 
35 40 45 

Thr Asn Gly Lys Gly Thr Thr Cys Ala Tyr lie Ala His Ser Leu Ala 
50 55 60 

Arg Ala Gly Tyr Lys Thr Gly Leu Tyr Thr Ser Pro His Leu Glu Arg 
65 70 75 80 

Val Asn Glu Arg He Arg He Asn Asp Arg Tyr He Ser Asp Gin Asp 

85 90 95 

Leu Met Ala Leu Thr Gly Gin He Ala Pro He He Asp His Leu Glu 

100 105 HO 

Asp Cys Leu Gly Glu Lys Tyr Tyr Ser Phe Glu He Leu Thr Ala Leu 
115 120 125 

Ala Phe Leu Tyr Phe Gin Gin Ala Gly Val Asp Phe Leu Val Leu Glu 
130 135 140 

Thr Gly Val Gly Gly Lys He Asp Ala Thr Asn Val Val Pro Ala Pro 
145 150 155 160 

Leu Val Ser Val He He Ser He Gly Tyr Asp His Thr His Val Leu 

165 170 175 

Gly Asn Thr Leu Glu Asp He Thr Arg His Lys Ala Gly He He Lys 

180 185 190 

Lys Gly Cys Pro Val Val Val Gly Pro Leu Ala Asp His Leu Leu Ala 
195 200 205 

He Val Lys Glu Val Ser Lys Glu Met Asp Ser Asn Leu Thr He Val 
210 215 ~ 220 

His Pro Asp Lys Phe Asp He Val His Gin Thr Leu Asp Tyr Gin Ser 
225 230 235 240 

Phe Lys Tyr Gly Gly Asp Leu Val Leu Glu Thr Gin Met He Gly Asn 

245 250 255 
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His Gin Leu Val Asn Thr Ala Leu Ala Tyx Glu Ala Leu Lys lie Val 

260 265 270 



Gin Gin Ser Tyr Pro Asp Leu Thr Asp Leu Asp lie Leu Glu Gly Leu 
275 280 285 

Lys Thr Thr His Trp Pro Gly Arg Met Gin Lys Leu Ser Asp Gin Pro 
290 295 300 

Val Val Val Leu Asp Gly Ala His Asn Glu lie Gly Val Lys Ala Leu 
305 310 315 320 

Arg Gin Ser He Asp His Phe Phe Pro Gly Lys Lys He Thr Tyr Phe 

325 330 335 



Ala Gly Met Met Val Glu Lys Asp Phe Ala Lys Met Phe Asp Leu Leu 

340 345 350 

Gly Glu Thr Ala Asp Lys Phe Tyr Leu He Ser Pro Asp Leu Thr Arg 
355 360 365 

Gly Phe Asp Val Asp Gin Ala Val Gin Ser Leu Thr Asp Lys Gly Tyr 
370 375 380 



Gin Ala Ser Ser Val Ala Ser Leu Gin Ala He Leu Asp Tyr He Asn 
385 390 395 400 



Gin Gin Ala Lys Ala Asp Glu He He He He Phe Gly Ser Leu Tyr 

405 410 415 



Leu Val Gly Asp Phe Leu Lys Leu Tyr His Glu Ala Ser Gly 

420 425 430 



<210> 25 
<211> 1653 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (91) . . (1653) 

<223> 

<400> 25 

cttcttgttt catttttaat cttatctgaa acaaatgatt tttcaactct tttttatctt 



60 
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actttaaaag ttttagttag gagccttagc ttg tac cgt ate tct atg aaa gac 114 

Met Tyr Arg lie Ser Met Lys Asp. 
1 5 

ttg cat gec eta tta get age aag cag cag ttg aaa gaa gtg gtc ggt 162 
Leu His Ala Leu Leu Ala Ser Lys Gin Gin Leu Lys Glu Val Val Gly 
10 15 20 



ccc gac caa gtt tgg cat tac aat ttg cct caa ggg gaa ttg gee gac 
Pro Asp Gin Val Trp His Tyr Asn Leu Pro Gin Gly Glu Leu Ala Asp 
25 30 35 40 

caa gtt ttt gac aaa ctt tec tac aat tec caa act gec tec tea gac 
Gin Val Phe Asp Lys Leu Ser Tyr Asn Ser Gin Thr Ala Ser Ser Asp 

45 50 55 

acc ctt ttc ttt tgc aag ggt get tec ttt aaa aga gac tac eta gee 
Thr Leu Phe Phe Cys Lys Gly Ala Ser Phe Lys Arg Asp Tyr Leu Ala 

60 65 70 



gee tac aag atg gac egg gtc tat gga ctg act ttc gac ttt gga gee 
Ala Tyr Lys Met Asp Arg Val Tyr Gly Leu Thr Phe Asp Phe Gly Ala 

205 210 215 



210 



258 



306 



450 



498 



cag gcg gtt gac cag ggt gtc caa gtc tat att tec gaa aaa ttg tat 354 
Gin Ala Val Asp Gin Gly Val Gin Val Tyr lie Ser Glu Lys Leu Tyr 
75 80 85 

caa ggc ctg gat get tat gec ate att gtc cgt gac ate cgc cag acc 402 
Gin Gly Leu Asp Ala Tyr Ala lie lie Val Arg Asp lie Arg Gin Thr 
90 95 100 

atg gec eta gtc get aag get ttt tac cag get cca gat gaa aaa ttg 
Met Ala Leu Val Ala Lys Ala Phe Tyr Gin Ala Pro Asp Glu Lys Leu 
105 110 115 120 

acc ctg att ggc att acc ggg acc aag ggc aag aca acc aca agt tac 
Thr Leu Xle Gly lie Thr Gly Thr Lys Gly Lys Thr Thr Thr Ser Tyr 

125 130 135 

etc etc aaa tec ate ctg gac cag gac caa gee ggt aag aca get att 
Leu Leu Lys Ser lie Leu Asp Gin Asp Gin Ala Gly Lys Thr Ala He 

140 145 150 

att tea acc ttg ggg att tec tta gac ggc cag acc caa gaa gaa gee 
He Ser Thr Leu Gly He Ser Leu Asp Gly Gin Thr Gin Glu Glu Ala 
155 160 165 

tec ctg acc act cct gaa gee ttg gac etc tac cag atg att gec egg 
Ser Leu Thr Thr Pro Glu Ala Leu Asp Leu Tyr Gin Met He Ala Arg 
170 175 180 

gee caa gac cag ggg atg gac caa ttg att atg gaa gta tct age caa 690 
Ala Gin Asp Gin Gly Met Asp Gin Leu He Met Glu Val Ser Ser Gin 
185 190 195 200 



546 



594 



642 



738 



WO 03/104391 



66/235 



PCT/US02/36122 



ttc tta aat att teg cct gac cat ate ggc cct aat gag cac cca gat 
Phe Leu Asn lie Ser Pro Asp His He Gly Pro Asn Glu His Pro Asp 

220 225 230 

atg gaa gat tac ttc tat tgt aaa agt cgt ttg gtt aaa cat tec aag 
Met Glu Asp Tyr Phe Tyr Cys Lys Ser Arg Leu Val Lys His Ser Lys 
235 240 245 

ttg gec ttg etc aac get gga ctt gac cag eta gac tac tta aaa gac 
Leu Ala Leu Leu Asn Ala Gly Leu Asp Gin Leu Asp Tyr Leu Lys Asp 
250 255 260 

ctt age caa aaa aat ggc ggt cag gtc caa gtt tac ggc caa gat ccc 
Leu Ser Gin Lys Asn Gly Gly Gin Val Gin Val Tyr Gly Gin Asp Pro 
265 270 275 280 

aag act tgt gac tac tat ttt gag gtt aac aac cag gac age cgc cgc 
Lys Thr Cys Asp Tyr Tyr Phe Glu Val Asn Asn Gin Asp Ser Arg Arg 

285 290 295 

ttt gec att aaa age caa age cct gat gac ttg gee att gat ggg gat 
Phe Ala He Lys Ser Gin Ser Pro Asp Asp Leu Ala He Asp Gly Asp 

300 305 310 

tac caa ttt gaa atg ttg ggt gat ttt aac aag gag aat gee ctt tgt 
Tyr Gin Phe Glu Met Leu Gly Asp Phe Asn Lys Glu Asn Ala Leu Cys 
315 320 325 

gee get ctt ata gcg ggg cat tta gaa gtt ggg caa gag gec att tac 
Ala Ala Leu He Ala Gly His Leu Glu Val Gly Gin Glu Ala He Tyr 
330 335 340 

caa gga ata gee cag gee caa gtg cca gga egg atg cag cat tat act 
Gin Gly He Ala Gin Ala Gin Val Pro Gly Arg Met Gin His Tyr Thr 
345 350 355 360 

tat ggc aac aat cac ate tat gta gac ttt gee cac aat tac ate age 
Tyr Gly Asn Asn His He Tyr Val Asp Phe Ala His Asn Tyr He Ser 

365 370 375 

ttg aaa aat ctt ttt gat ttt gec caa gac caa cac ccg gac cac ace 
Leu Lys Asn Leu Phe Asp Phe Ala Gin Asp Gin His Pro Asp His Thr 

380 385 390 



786 



aag gat atg gga tac ttg ctg tec caa tac caa ggg gaa gtt ate ttg 
Lys Asp Met Gly Tyr Leu Leu Ser Gin Tyr Gin Gly Glu Val He Leu 
410 415 420 

acc gaa gat gac ccc aat ttt gaa gac gtt caa get ate tgc caa gaa 
Thr Glu Asp Asp Pro Asn Phe Glu Asp Val Gin Ala He Cys Gin Glu 
425 430 435 440 



834 



882 



930 



978 



1026 



1074 



1122 



1170 



1218 



1266 



atg gtg gtt gtc ttg ggg gee cct ggc aac aag ggg gtg tct cgc cgc 1314 
Met Val Val Val Leu Gly Ala Pro Gly Asn Lys Gly Val Ser Arg Arg 
395 400 405 



1362 



1410 
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att gcc caa tac att gat ggc ccc ate cag gtg acc ttt aat gat aac 
lie Ala Gin Tyr lie Asp Gly Pro lie Gin Val Thr Phe Asn Asp Asn 

445 450 455 

egg ata aat gcc ate caa gac etc eta gag tec tta acc cca gaa agt 
Arg He Asn Ala He Gin Asp Leu Leu Glu Ser Leu Thr Pro Glu Ser 

460 465 470 



1458 



egg egg ggt gtg aag gaa gat tat gcg gga gac cac aaa ttg gtt gaa 
Arg Arg Gly Val Lys Glu Asp Tyr Ala Gly Asp His Lys Leu Val Glu 
490 495 500 

gca ttt tta aac cag caa aag act tct tct cat gag aag ctt gag ggt 
Ala Phe Leu Asn Gin Gin Lys Thr Ser Ser His Glu Lys Leu Glu Gly 
505 510 515 520 

tag 



<210> 26 
<211> 520 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 26 

Met Tyr Arg He Ser Met Lys Asp Leu His Ala Leu Leu Ala Ser Lys 
15 10 15 



Gin Gin Leu Lys Glu Val Val Gly Pro Asp Gin Val Trp His Tyr Asn 

20 25 30 



Leu Pro Gin Gly Glu Leu Ala Asp Gin Val Phe Asp Lys Leu Ser Tyr 
35 40 45 

Asn Ser Gin Thr Ala Ser Ser Asp Thr Leu Phe Phe Cys Lys Gly Ala 
50 55 60 

Ser Phe Lys Arg Asp Tyr Leu Ala Gin Ala Val Asp Gin Gly Val Gin 
65 70 75 80 

Val Tyr He Ser Glu Lys Leu Tyr Gin Gly Leu Asp Ala Tyr Ala He 

85 90 95 



1506 



caa aaa gtc ate ctg ctt gca ggc aag ggg tec gac cag tac atg ctg 1554 
Gin Lys Val He Leu Leu Ala Gly Lys Gly Ser Asp Gin Tyr Met Leu 
475 480 485 



1602 



1650 



1653 



He Val Arg Asp He Arg Gin Thr Met Ala Leu Val Ala Lys Ala Phe 

100 105 110 
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Tyr Gin Ala Pro Asp Glu Lys Leu Thr Leu lie Gly lie Thr Gly Thr 
115 120 125 



Lys Gly Lys Thr Thr Thr Ser Tyr Leu Leu Lys Ser lie Leu Asp Gin 
130 135 140 



Asp Gin Ala Gly Lys Thx Ala lie He Ser Thx Leu Gly He Ser Leu 
145 150 155 160 



Asp Gly Gin Thr Gin Glu Glu Ala Ser Leu Thx Thr Pro Glu Ala Leu 

165 170 175 



Asp Leu Tyr Gin Met He Ala Arg Ala Gin Asp Gin Gly Met Asp Gin 

180 185 190 



Leu He Met Glu Val Ser Ser Gin Ala Tyr Lys Met Asp Arg Val Tyr 
195 200 205 



Gly Leu Thr Phe Asp Phe Gly Ala Phe Leu Asn He Ser Pro Asp His 
210 215 220 



He Gly Pro Asn Glu His Pro Asp Met Glu Asp Tyr Phe Tyr Cys Lys 
225 230 235 240 



Ser Arg Leu Val Lys His Ser Lys Leu Ala Leu Leu Asn Ala Gly Leu 

245 250 255 



Asp Gin Leu Asp Tyr Leu Lys Asp Leu Ser Gin Lys Asn Gly Gly Gin 

260 265 270 



Val Gin Val Tyr Gly Gin Asp Pro Lys Thr Cys Asp Tyr Tyr Phe Glu 
275 280 285 



Val Asn Asn Gin Asp Ser Arg Arg Phe Ala He Lys Ser Gin Ser Pro 
290 295 300 



Asp Asp Leu Ala He Asp Gly Asp Tyr Gin Phe Glu Met Leu Gly Asp 
305 310 315 320 



Phe Asn Lys Glu Asn Ala Leu Cys Ala Ala Leu He Ala Gly His Leu 

325 330 335 



Glu Val Gly Gin Glu Ala He Tyr Gin Gly He Ala Gin Ala Gin Val 
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340 345 350 



Pro Gly Arg Met Gin His Tyr Thr Tyr Gly Asn Asn His lie Tyr Val 
355 360 365 



Asp Phe Ala His Asn Tyr lie Ser Leu Lys Asn Leu Phe Asp Phe Ala 
370 375 380 



Gin Asp Gin His Pro Asp His Thr Met Val Val Val Leu Gly Ala Pro 
385 390 395 400 



Gly Asn Lys Gly Val Ser Arg Arg Lys Asp Met Gly Tyr Leu Leu Ser 

405 410 415 



Gin Tyr Gin Gly Glu Val lie Leu Thr Glu Asp Asp Pro Asn Phe Glu 

420 425 430 



Asp Val Gin Ala lie Cys Gin Glu lie Ala Gin Tyr lie Asp Gly Pro 
435 440 445 



XI e Gin Val Thr Phe Asn Asp Asn Arg lie Asn Ala lie Gin Asp Leu 
450 455 460 



Leu Glu Ser Leu Thr Pro Glu Ser Gin Lys Val lie Leu Leu Ala Gly 
465 470 475 480 



Lys Gly Ser Asp Gin Tyr Met Leu Arg Arg Gly Val Lys Glu Asp Tyr 

485 490 495 



Ala Gly Asp His Lys Leu Val Glu Ala Phe Leu Asn Gin Gin Lys Thr 

500 505 510 



Ser Ser His Glu Lys Leu Glu Gly 
515 520 



<210> 27 
<211> 636 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222* (25) . . (636) 

<223> 
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<400> 27 

aggactgaaa ggaaggttgg agtc atg aag tgg ttg agt egg ate ttg att 51 

Met Lys Trp Leu Ser Axg lie Leu lie 
1 5 



gtt gtt ggg ata ggc ttt ctg att gec ttt ggc tac acg att tat gac 
Val Val Gly He Gly Phe Leu lie Ala Phe Gly Tyx Thr He Tyr Asp 
10 15 20 25 

cat get aac teg aca teg gtt acc eta gaa gaa gee cag gtg gee ctg 
His Ala Asn Ser Thr Ser Val Thr Leu Glu Glu Ala Gin Val Ala Leu 

30 35 40 



ggc caa gat ggg gcg agt gac ate gat ate caa aac tac cag cct gaa 
Gly Gin Asp Gly Ala Ser Asp He Asp He Gin Asn Tyr Gin Pro Glu 
60 65 70 

get ggg gag get ttt ggg gtc tta gat att ccc aaa etc gac egg age 
Ala Gly Glu Ala Phe Gly Val Leu Asp He Pro Lys Leu Asp Arg Ser 
'75 80 85 

att ggc att gta gec gga acg gat gca gac tct ctt aag aag ggg gta 
He Gly He Val Ala Gly Thr Asp Ala Asp Ser Leu Lys Lys Gly Val 
90 95 100 105 

ggt cac gtt gag aat aca gtc ttc cct ggc caa ggc gaa caa att gtc 
Gly His Val Glu Asn Thr Val Phe Pro Gly Gin Gly Glu Gin He Val 

110 115 120 



att ggc gac aat ttt ate gtt caa atg cct tac ggg gac tat gaa tat 
He Gly Asp Asn Phe He Val Gin Met Pro Tyr Gly Asp Tyr Glu Tyr 
140 145 150 



egg cct atg ggg gaa gaa gtc tta gtg gtt tea acc tgc tac ccc ttt 
Arg Pro Met Gly Glu Glu Val Leu Val Val Ser Thr Cys Tyr Pro Phe 
170 175 180 185 

gaa ttt tac ggt ttt gec cct gac cgc ttt gtt ttc tat tgt tac ccc 
Glu Phe Tyr Gly Phe Ala Pro Asp Arg Phe Val Phe Tyr Cys Tyr Pro 

190 195 200 

gtt gaa taa 
Val Glu 



99 



147 



gaa gaa age egg gee cag get get gaa get ggg gac ggg gac cag gat 195 
Glu Glu Ser Arg Ala Gin Ala Ala Glu Ala Gly Asp Gly Asp Gin Asp 

45 50 55 



243 



291 



339 



387 



etc tct ggc cac egg gat acc gtc ttc egg gac ttt ggc gaa tta gaa 435 
Leu Ser Gly His Arg Asp Thr Val Phe Arg Asp Phe Gly Glu Leu Glu 

125 130 135 



483 



gag att cag gac tat gaa att gtc gac egg gat gat acc tec gtc ate 531 
Glu He Gin Asp Tyr Glu He Val Asp Arg Asp Asp Thr Ser Val He 
155 160 165 



579 



627 



636 
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<210> 28 
<211> 203 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 28 

Met Lys Trp Leu Ser Arg lie Leu lie Val Val Gly lie Gly Phe Leu 
1 5 10 15 

lie Ala Phe Gly Tyr Thr He Tyr Asp His Ala Asn Ser Thr Ser Val 

20 25 30 

Thr Leu Glu Glu Ala Gin Val Ala Leu Glu Glu Ser Arg Ala Gin Ala 
35 40 45 

Ala Glu Ala Gly Asp Gly Asp Gin Asp Gly Gin Asp Gly Ala Ser Asp 
50 55 60 

He Asp He Gin Asn Tyr Gin Pro Glu Ala Gly Glu Ala Phe Gly Val 
65 70 75 80 

Leu Asp He Pro Lys Leu Asp Arg Ser He Gly He Val Ala Gly Thr 

85 90 95 



Asp Ala Asp Ser Leu Lys Lys Gly Val Gly His Val Glu Asn Thr Val 

100 105 HO 

Phe Pro Gly Gin Gly Glu Gin He Val Leu Ser Gly His Arg Asp Thr 
115 120 125 

Val Phe Arg Asp Phe Gly Glu Leu Glu He Gly Asp Asn Phe He Val 
130 135 140 

Gin Met Pro Tyr Gly Asp Tyr Glu Tyr Glu He Gin Asp Tyr Glu He 
145 150 155 160 

Val Asp Arg Asp Asp Thr Ser Val He Arg Pro Met Gly Glu Glu Val 

165 170 175 



Leu Val Val Ser Thr Cys Tyr Pro Phe Glu Phe Tyr Gly Phe Ala Pro 

180 185 190 



Asp Arg Phe Val Phe Tyr Cys Tyr Pro Val Glu 
195 200 
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<210> 29 
<211> 1290 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (1) . . (1290) 

<223> 



<400> 29 

atg cag tat gca gaa ctt ctt gac etc ctg ccc eta caa gaa caa ggg 

Met Gin Tyr Ala Glu Leu Leu Asp Leu Leu Pro Leu Gin Glu Gin Gly 
15 10 15 

aag atg gat ttg ggg eta gca acc atg acc cag gtg atg gac tta ttg 
Lys Met Asp Leu Gly Leu Ala Thr Met Thr Gin Val Met Asp Leu Leu 

20 25 30 

ggc aag ccc caa gac cag gtc ccc atg gtt cat ate get ggc acc aat 
Gly Lys Pro Gin Asp Gin Val Pro Met Val His lie Ala Gly Thr Asn 
35 40 45 

ggc aag ggg teg gec gca gec ttt aca gag cga ata etc agg gag get 
Gly Lys Gly Ser Ala Ala Ala Phe Thr Glu Arg lie Leu Arg Glu Ala 
50 55 60 

ggc tac aag gtc ggc ttg tat att tec cct tec eta gtg gaa ttt aat 
Gly Tyr Lys Val Gly Leu Tyr lie Ser Pro Ser Leu Val Glu Phe Asn 
65 70 75 80 

gaa egg ate caa ate aat ggc caa gec aca agt gat gat cag ttg etc 
Glu Arg lie Gin lie Asn Gly Gin Ala Thr Ser Asp Asp Gin Leu Leu 

85 90 95 

aag gca gtc aag acc eta age cag gee tta gaa ggc aca tec ctt tgc 
Lys Ala Val Lys Thr Leu Ser Gin Ala Leu Glu Gly Thr Ser Leu Cys 

100 105 110 

ctg act gaa ttt gaa ctt ttt act gec ctg gee ttt ttg acc ttc cag 
Leu Thr Glu Phe Glu Leu Phe Thr Ala Leu Ala Phe Leu Thr 1 Phe Gin 
115 120 125 



tta gat get acc aat gtg ata age cgt cct gee gtc acc gec att acc 
Leu Asp Ala Thr Asn Val lie Ser Arg Pro Ala Val Thr Ala lie Thr 
145 150 155 160 

aag att ggc atg gac cat acc get ttt tta ggg gat age ctg cca gaa 
Lys lie Gly Met Asp His Thr Ala Phe Leu Gly Asp Ser Leu Pro Glu 

165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



gac cag get tgt gat ata gec gtt gta gag gtc gga tta gga gga egg 432 
Asp Gin Ala Cys Asp lie Ala Val Val Glu Val Gly Leu Gly Gly Arg 
130 135 140 



480 



528 



ata gee ggt gag aag gca gee ate gec aaa gee ggc teg cct atg gtg 



576 
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He Ala Gly Glu Lys Ala Ala He Ala Lys Ala Gly Ser Pro Met Val 

180 185 190 

gtc tat ccc cag ggg cca gaa gtg act egg gtg ate caa aat cag gcg 
Val Tyr Pro Gin Gly Pro Glu Val Thr Arg Val He Gin Asn Gin Ala 
195 200 205 

gac egg gta gga gee tct ctg acc eta att tct caa tec gac ctg act 
Asp Arg Val Gly Ala Ser Leu Thr Leu He Ser Gin Ser Asp Leu Thr 
210 215 220 

tat aac ctg act teg gac etc ttg caa gac ttt gaa tac aag cag gtt 
Tyr Asn Leu Thr Ser Asp Leu Leu Gin Asp Phe Glu Tyr Lys Gin Val 
225 230 235 240 

ccc tac cgc att tea ctt tta gaa gat tat caa att tac aac gec ctg 
Pro Tyr Arg He Ser Leu Leu Glu Asp Tyr Gin He Tyr Asn Ala Leu 

245 250 255 

gta gca etc gaa ate tct ttt gee tta cag gat get ggc tgg cag att 
Val Ala Leu Glu He Ser Phe Ala Leu Gin Asp Ala Gly Trp Gin He 

260 265 270 

age cct aaa gec att aaa caa ggt ttg gtt gag acc cgc tgg ccc ggc 
Ser Pro Lys Ala He Lys Gin Gly Leu Val Glu Thr Arg Trp Pro Gly 
275 280 285 

cgt ttt gaa ctt ate gee tct cat ccg acc gtg ate gtt gat ggg tct 
Arg Phe Glu Leu He Ala Ser His Pro Thr Val He Val Asp Gly Ser 
290 295 300 

cat aat gaa gac ggc ctg cag get etc ttg get aac eta gac cgc tac 
His Asn Glu Asp Gly Leu Gin Ala Leu Leu Ala Asn Leu Asp Arg Tyr 
305 310 315 320 

ttt cca gaa caa aaaagg att ggg ate gta ggc atg ttg gec gac aag 
Phe Pro Glu Gin Lys Arg He Gly He Val Gly Met Leu Ala Asp Lys 

325 330 335 

gat gtt gat gee gec eta get cct tta acc aaa age ttt gac egg ctt 
Asp Val Asp Ala Ala Leu Ala Pro Leu Thr Lys Ser Phe Asp Arg Leu 

340 345 350 

tat acg gtg aca ccc gat teg ccg egg ggg atg gca gee cct caa atg 
Tyr Thr Val Thr Pro Asp Ser Pro Arg Gly Met Ala Ala Pro Gin Met 
355 360 365 

aaa gaa aaa ctg acc gaa atg gtg teg ccg tct act egg gtc ata get 
Lys Glu Lys Leu Thr Glu Met Val Ser Pro Ser Thr Arg Val He Ala 
370 375 380 

tgt gaa agt tat aac cag gec tta gac ctg gca ggt caa gta gee ggc 
Cys Glu Ser Tyr Asn Gin Ala Leu Asp Leu Ala Gly Gin Val Ala Gly 
385 390 395 400 

gga gat gac eta att gtc gtt ttt gga agt ttt tat att gtt ggt aag 
Gly Asp Asp Leu He Val Val Phe Gly Ser Phe Tyr He Val Gly Lys 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 
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405 410 415 

ttt aga cag ctg att tta gca aga aga aat ggg gaa gtt taa 1290 
Phe Arg Gin Leu He Leu Ala Arg Arg Asn Gly Glu Val 

420 425 



<210> 30 
<211> 429 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 30 

Met Gin Tyr Ala Glu Leu Leu Asp Leu Leu Pro Leu Gin Glu Gin Gly 
15 10 15 



Lys Met Asp Leu Gly Leu Ala Thr Met Thr Gin Val Met Asp Leu Leu 

20 25 30 



Gly Lys Pro Gin Asp Gin Val Pro Met Val His He Ala Gly Thr Asn 
35 40 45 



Gly Lys Gly Ser Ala Ala Ala Phe Thr Glu Arg He Leu Arg Glu Ala 
50 55 60 



Gly Tyr Lys Val Gly Leu Tyr He Ser Pro Ser Leu Val Glu Phe Asn 
65 70 - 75 80 



Glu Arg lie Gin He Asn Gly Gin Ala Thr Ser Asp Asp Gin Leu Leu 

85 90 95 



Lys Ala Val Lys Thr Leu Ser Gin Ala Leu Glu Gly Thr Ser Leu Cys 

100 105 110 



Leu Thr Glu Phe Glu Leu Phe Thr Ala Leu Ala Phe Leu Thr Phe Gin 
115 120 125 



Asp Gin Ala Cys Asp He Ala Val Val Glu Val Gly Leu Gly Gly Arg 
130 135 140 



Leu Asp Ala Thr Asn Val He Ser Arg Pro Ala Val Thr Ala He Thr 
145 150 155 160 



Lys He Gly Met Asp His Thr Ala Phe Leu Gly Asp Ser Leu Pro Glu 

165 170 175 
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He Ala Gly Glu Lys Ala Ala He Ala Lys Ala Gly Ser Pro Met Val 

180 185 190 



Val Tyr Pro Gin Gly Pro Glu Val Thr Arg Val He Gin Asn Gin Ala 
195 200 205 



Asp Arg Val Gly Ala Ser Leu Thr Leu He Ser Gin Ser Asp Leu Thr 
210 215 220 



Tyr Asn Leu Thr Ser Asp Leu Leu Gin Asp Phe Glu Tyr Lys Gin Val 
225 230 235 240 



Pro Tyr Arg lie Ser Leu Leu Glu Asp Tyr Gin He Tyr Asn Ala Leu 

245 250 255 



Val Ala Leu Glu He Ser Phe Ala Leu Gin Asp Ala Gly Trp Gin He 

260 265 270 



Ser Pro Lys Ala He Lys Gin Gly Leu Val Glu Thr Arg Trp Pro Gly 
275 280 285 



Arg Phe Glu Leu He Ala Ser His Pro Thr Val He Val Asp Gly Ser 
290 295 300 



His Asn Glu Asp Gly Leu Gin Ala Leu Leu Ala Asn Leu Asp Arg Tyr 
305 310 315 320 



Phe Pro Glu Gin Lys Arg He Gly He Val Gly Met Leu Ala Asp Lys 

325 330 335 



Asp Val Asp Ala Ala Leu Ala Pro Leu Thr Lys Ser Phe Asp Arg Leu 

340 345 350 



Tyr Thr Val Thr Pro Asp Ser Pro Arg Gly Met Ala Ala Pro Gin Met 
355 360 365 



Lys Glu Lys Leu Thr Glu Met Val Ser Pro Ser Thr Arg Val He Ala 
370 375 380 



Cys Glu Ser Tyr Asn Gin Ala Leu Asp Leu Ala Gly Gin Val Ala Gly 
385 390 395 400 



Gly Asp Asp Leu He Val Val Phe Gly Ser Phe Tyr He Val Gly Lys 



7S 
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405 410 415 



Phe Arg Gin Leu lie Leu Ala Arg Arg Asn Gly Glu Val 

420 425 



<210> 31 
<211> 387 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (387) 

<223> 

<400> 31 

agaaagagga ataag atg gat aaa aga gat aag at a cgc ttg caa ggg atg 51 

Met Asp Lys Arg Asp Lys He Arg Leu Gin Gly Met 
1 5 10 

act ttt cac ggc cac cac ggt ttg atg gag gcc gaa acc aag ttg ggt 99 
Thr Phe His Gly His His Gly Leu Met Glu Ala Glu Thr Lys Leu Gly 
15 20 25 



cag att ttt aaa gtc gac ctt gtc tta gta act gac etc aag tta gcg 
Gin He Phe Lys Val Asp Leu Val Leu Val Thr Asp Leu Lys Leu Ala 
30 35 40 



147 



ggt caa aca gac aag atg ggg cac agt ate cac' tac ggg gaa gtt tat 195 
Gly Gin Thr Asp Lys Met Gly His Ser He His Tyr Gly Glu Val Tyr 
45 50 55 60 

gac ctg gtc aag tec att gtg gaa ggt acc ccc ttt aag ctt ttg gag 243 
Asp Leu Val Lys Ser He Val Glu Gly Thr Pro Phe Lys Leu Leu Glu 

65 70 75 

tec ttg gcg gaa acc eta gcc caa gaa gtt etc aag act ttt gac cag 291 
Ser Leu Ala Glu Thr Leu Ala Gin Glu Val Leu Lys Thr Phe Asp Gin 

80 85 90 

gtt gag gag gtc ttg gtc egg gtc aac aaa ccc cag gcc ccg att cct 339 
Val Glu Glu Val Leu Val Arg Val Asn Lys Pro Gin Ala Pro He Pro 
95 100 105 

ggt gtc ttt gac aat gta gcg gtg gaa ate acc egg gcc cgt cac tag 387 
Gly Val Phe Asp Asn Val Ala Val Glu He Thr Arg Ala Arg His 
110 115 120 



<210> 32 
<211> 123 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 32 
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Met Asp Lys Arg Asp Lys He Arg Leu Gin Gly Met Thr Phe His Gly 
15 10 15 



His His Gly Leu Met Glu Ala Glu Thr Lys Leu Gly Gin He Phe Lys 

20 25 30 



Val Asp Leu Val Leu Val Thr Asp Leu Lys Leu Ala Gly Gin Thr Asp 
35 40 45 



Lys Met Gly His Ser He His Tyr Gly Glu Val Tyr Asp Leu Val Lys 
50 55 60 



Ser He Val Glu Gly Thr Pro Phe Lys Leu Leu Glu Ser Leu Ala Glu 
65 70 75 80 



Thr Leu Ala Gin Glu Val Leu Lys Thr Phe Asp Gin Val Glu Glu Val 

85 90 95 



Leu Val Arg Val Asn Lys Pro Gin Ala Pro He Pro Gly Val Phe Asp 

100 105 110 



Asn Val Ala Val Glu He Thr Arg Ala Arg His 
115 120 



<210> 33 
<211> 552 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (22) . . (552) 

<223> 

<400> 33 

ataggtaagg aggaatatag a gtg aag ggt gtt atg ata gga etc ggt tct 51 

Met Lys Gly Val Met He Gly Leu Gly Ser 
15 10 

aat atg ggg act aag ttg get tac tta aac egg get ttg gee aaa ata 99 
Asn Met Gly Thr Lys Leu Ala Tyr Leu Asn Arg Ala Leu Ala Lys He 

15 20 25 

aat age eta gac cag gta gca gtc aag caa gtt tea aag gtt tac cag 147 
Asn Ser Leu Asp Gin Val Ala Val Lys Gin Val Ser Lys Val Tyr Gin 

30 35 40 



act gaa ccg gtg ggc tac aag gac cag gac gat ttt tac aat atg gtt 
Thr Glu Pro Val Gly Tyr Lys Asp Gin Asp Asp Phe Tyr Asn Met Val 



195 
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45 50 55 

get ggc ctt gaa att gaa cca ggc aag acc ccc ttg gac etc tta gaa 

Ala Gly Leu Glu lie Glu Pro Gly Lys Thr Pro Leu Asp Leu Leu Glu 
60 65 70 

gac ttg ctg gcg att gag gca gac ctg gac agg aag egg acc att aaa 

Asp Leu Leu Ala He Glu Ala Asp Leu Asp Arg Lys Arg Thr He Lys 
75 80 85 90 



gaa att gac cat ccc aag etc caa gtt ccc cac cca agg etc cag gac 
Glu He Asp His Pro Lys Leu Gin Val Pro His Pro Arg Leu Gin Asp 

110 115 120 

egg gec ttt gtc ttg gtc ccc ttg get gag ttg gac ccc aac tac ctg 
Arg Ala Phe Val Leu Val Pro Leu Ala Glu Leu Asp Pro Asn Tyr Leu 
125 130 135 

gtt cct ggc ata gat aag aca gtt gcg gac ttg ttg get tct tta aac 
Val Pro Gly He Asp Lys Thr Val Ala Asp Leu Leu Ala Ser Leu Asn 
140 145 150 



243 



291 



aat ggc ccc cga acc att gac ttg gat gtc ttg ctg gtg gag ggt caa 339 
Asn Gly Pro Arg Thr He Asp Leu Asp Val Leu Leu Val Glu Gly Gin 

95 100 105 



387 



435 



483 



caa acc gac eta gca ggg gtg gag get ttg ggt cag ttg acg aac eta 531 
Gin Thr Asp Leu Ala Gly Val Glu Ala Leu Gly Gin Leu Thr Asn Leu 
155 - 160 165 170 

tta gaa gac cgt gag get tga 552 
Leu Glu Asp Arg Glu Ala 

175 



<210> 34 
<211> 176 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 34 

Met Lys Gly Val Met He Gly Leu Gly Ser Asn Met Gly Thr Lys Leu 
15 10 15 

Ala Tyr Leu Asn Arg Ala Leu Ala Lys He Asn Ser Leu Asp Gin Val 

20 25 30 



Ala Val Lys Gin Val Ser Lys Val Tyr Gin Thr Glu Pro Val Gly Tyr 
35 40 45 



Lys Asp Gin Asp Asp Phe Tyr Asn Met Val Ala Gly Leu Glu He Glu 
50 55 60 
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Pro Gly Lys Thr Pro Leu Asp Leu Leu Glu Asp Leu Leu Ala lie Glu 
65 70 75 80 

Ala Asp Leu Asp Arg Lys Arg Thr lie Lys Asn Gly Pro Arg Thr lie 

85 90 95 



Asp Leu Asp Val Leu Leu Val Glu Gly Gin Glu lie Asp His Pro Lys 

100 105 110 



Leu Gin Val Pro His Pro Arg Leu Gin Asp Arg Ala Phe Val Leu Val 
115 120 125 



Pro Leu Ala Glu Leu Asp Pro Asn Tyr Leu Val Pro Gly lie Asp Lys 
130 135 140 

Thr Val Ala Asp Leu Leu Ala Ser Leu Asn Gin Thr Asp Leu Ala Gly 
145 150 155 ISO 

Val Glu Ala Leu Gly Gin Leu Thr Asn Leu Leu Glu Asp Arg Glu Ala 

165 170 175 



<210> 35 
<211> 1242 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (40) . . (1242) 

<223> 

<400> 35 

aatcttctta atatcgcttg gcccaagacc gctataata gtg gta agt gat tat 

Met Val Ser Asp Tyr 
1 5 

ttt agg agg ttc aat atg caa ata gga att gac aag ctg get ttt gcg 
Phe Arg Arg Phe Asn Met Gin lie Gly lie Asp Lys Leu Ala Phe Ala 

10 15 20 

act cca acc agg tac ttg gaa atg gcg agt ctg gec caa gec egg tec 
Thr Pro Thr Arg Tyr Leu Glu Met Ala Ser Leu Ala Gin Ala Arg Ser 

25 30 35 

caa gac cct aat aaa tat ate aag ggg eta ggc caa gaa gec atg get 
Gin Asp Pro Asn Lys Tyr He Lys Gly Leu Gly Gin Glu Ala Met Ala 
40 45 50 



54 



102 



150 



198 
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gtc cct gaa gaa agt gat gat gcc gtc age ttg gcg get aat gee ggt 
Val Pro Glu Glu Ser Asp Asp Ala Val Ser Leu Ala Ala Asn Ala Gly 
55 60 65 

aat tta ate tta agt gaa gaa gac aag get get att gac atg gtg ata 
Asn Leu lie Leu Ser Glu Glu Asp Lys Ala Ala lie Asp Met Val lie 
70 75 80 85 

gtc ggt acc gaa tct 'ggg gtc gac cag tec aag teg gca gcc age tgg 
Val Gly Thr Glu Ser Gly Val Asp Gin Ser Lys Ser Ala Ala Ser Trp 

90 95 100 

gtt cat gac ctg ttg ggg ate aac ccc cat get aga age ctg gag ate 
Val His Asp Leu Leu Gly lie Asn Pro His Ala Arg Ser Leu Glu lie 

105 110 115 

aag caa gcc tgc tac ggg get acg get gga etc aaa eta get gtg gcc 
Lys Gin Ala Cys Tyr Gly Ala Thr Ala Gly Leu Lys Leu Ala Val Ala 
120 125 130 

cac eta gcc tta aac cct gac tec aag gtt tta gtc ate ggt tea gac 
His Leu Ala Leu Asn Pro Asp Ser Lys Val Leu Val lie Gly Ser Asp 
135 140 145 

ata gcc aag tat ggt ttg gaa aca ggg ggc gag ccc act caa gga get 
He Ala Lys Tyr Gly Leu Glu Thr Gly Gly Glu Pro Thr Gin Gly Ala 
150 155 160 165 

ggg gcg gtc gcc ate tta gtc age cgt gac cct gca att get gtg gtc 
Gly Ala Val Ala He Leu Val Ser Arg Asp Pro Ala He Ala Val Val 

170 175 180 

aac aat gac agt gcc atg ctg acc aaa aat att gca gac ttt tgg cga 
Asn Asn Asp Ser Ala Met Leu Thr Lys Asn He Ala Asp Phe Trp Arg 

185 190 195 

ccc aac tac age gat tat gcc cat gta gat ggc aag ttc tec aac cag 
Pro Asn Tyr Ser Asp Tyr Ala His Val Asp Gly Lys Phe Ser Asn Gin 
200 205 210 

gca tac ttg tec aac eta gca gaa gtc tgg cgc cag tat aag ate aaa 
Ala Tyr Leu Ser Asn Leu Ala Glu Val Trp Arg Gin Tyr Lys He Lys 
215 220 225 

aac cag ctg tct get aag gat ttc aag gcc atg gtc ttc cac age ccc 
Asn Gin Leu Ser Ala Lys Asp Phe Lys Ala Met Val Phe His Ser Pro 
230 235 240 245 

tat acc aag atg ggg aaa aag gcc tta etc aaa eta gga gat tat gaa 
Tyr Thr Lys Met Gly Lys Lys Ala Leu Leu Lys Leu Gly Asp Tyr Glu 

250 255 260 

gac cag aaa gag att gac cgc ttg ctg gcc tat tac gag cct ggt cgc 
Asp Gin Lys Glu He Asp Arg Leu Leu Ala Tyr Tyr Glu Pro Gly Arg 

265 270 275 
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tac tac aat aag egg gtc ggt aat ate tat act ggg tct ctt tac ttg 918 
Tyr Tyr Asn Lys Arg Val Gly Asn lie Tyr Thr Gly Ser Leu Tyr Leu 
280 285 290 

agt ttg att tec etc tta gac cag gta agt gac ctg gag get ggc gac 966 

Ser Leu lie Ser Leu Leu Asp Gin Val Ser Asp Leu Glu Ala Gly Asp 

295 300 305 

egg att ggc etc tat tct tat ggg tct ggt gee gtt gga gag ttc ttt 1014 

Arg He Gly Leu Tyr Ser Tyr Gly Ser Gly Ala Val Gly Glu Phe Phe 
310 315 320 325 

age att egg etc cag cca ggt tac aag gaa age tta cag caa gtt gac 1062 

Ser He Arg Leu Gin Pro Gly Tyr Lys Glu Ser Leu Gin Gin Val Asp 

330 335 340 

ttc gac cag gtt gtc aac cag cgt tea gca tta gag atg tac age tat 1110 

Phe Asp Gin Val Val Asn Gin Arg Ser Ala Leu Glu Met Tyr Ser Tyr 

345 350 355 

cag gac ttg ctg acc ttt age eta cct caa gac ggc caa act tac act 1158 

Gin Asp Leu Leu Thr Phe Ser Leu Pro Gin Asp Gly Gin Thr Tyr Thr 
360 365 370 

aca gat aaa agt cac cag gtc cca ggc cgt ttt gtc tta gac egg gtg 1206 

Thr Asp Lys Ser His Gin Val Pro Gly Arg Phe Val Leu Asp Arg Val 

375 380 385 

gee gac cat ate cgt tac tac egg cgc ttg get taa 1242 
Ala Asp His He Arg Tyr Tyr Arg Arg Leu Ala 
390 395 400 



<210> 36 
<211> 400 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 36 

Met Val Ser Asp Tyr Phe Arg Arg Phe Asn Met Gin He Gly He Asp 
15 10 15 



Lys Leu Ala Phe Ala Thr Pro Thr Arg Tyr Leu Glu Met Ala Ser Leu 

20 25 30 



Ala Gin Ala Arg Ser Gin Asp Pro Asn Lys Tyr He Lys Gly Leu Gly 
35 40 45 



Gin Glu Ala Met Ala Val Pro Glu Glu Ser Asp Asp Ala Val Ser Leu 
50 55 60 



Ala Ala Asn Ala Gly Asn Leu He Leu Ser Glu Glu Asp Lys Ala Ala 
65 70 75 80 
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He Asp Met Val He Val Gly Thr Glu Ser Gly Val Asp Gin Ser Lys 

85 90 95 



Ser Ala Ala Ser Trp Val His Asp Leu Leu Gly He Asn Pro His Ala 

100 105 110 



Arg Ser Leu Glu He Lys Gin Ala Cys Tyr Gly Ala Thr Ala Gly Leu 
115 120 125 



Lys Leu Ala Val Ala His Leu Ala Leu Asn Pro Asp Ser Lys Val Leu 
130 135 140 



Val He Gly Ser Asp He Ala Lys Tyr Gly Leu Glu Thr Gly Gly Glu 
145 150 155 160 



Pro Thr Gin Gly Ala Gly Ala Val Ala He Leu Val Ser Arg Asp Pro 

165 170 175 



Ala He Ala Val Val Asn Asn Asp Ser Ala Met Leu Thr Lys Asn He 

180 185 190 



Ala Asp Phe Trp Arg Pro Asn Tyr Ser Asp Tyr Ala His Val Asp Gly 
195 200 205 



Lys Phe Ser Asn Gin Ala Tyr Leu Ser Asn Leu Ala Glu Val Trp Arg 
210 215 220 



Gin Tyr Lys He Lys Asn Gin Leu Ser Ala Lys Asp Phe Lys Ala Met 
225 230 235 240 



Val Phe His Ser Pro Tyr Thr Lys Met Gly Lys Lys Ala Leu Leu Lys 

245 250 255 



Leu Gly Asp Tyr Glu Asp Gin Lys Glu He Asp Arg Leu Leu Ala Tyr 

260 265 270 



Tyr Glu Pro Gly Arg Tyr Tyr Asn Lys Arg Val Gly Asn He Tyr Thr 
275 280 285 



Gly Ser Leu Tyr Leu Ser Leu He Ser Leu Leu Asp Gin Val Ser Asp 
290 295 300 
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Leu Glu Ala Gly Asp Arg lie Gly Leu Tyr Ser Tyr Gly Ser Gly Ala 
305 310 315 320 



Val Gly Glu Phe Phe Ser lie Arg Leu Gin Pro Gly Tyr Lys Glu Ser 

325 330 335 



Leu Gin Gin Val Asp Phe Asp Gin Val Val Asn Gin Arg Ser Ala Leu 

340 345 350 



Glu Met Tyr Ser Tyr Gin Asp Leu Leu Thr Phe Ser Leu Pro Gin Asp 
355 360 365 



Gly Gin Thr Tyr Thr Thr Asp Lys Ser His Gin Val Pro Gly Arg Phe 
370 375 380 



Val Leu Asp Arg Val Ala Asp His lie Arg Tyr Tyr Arg Arg Leu Ala 
385 390 395 400 



<210> 37 
<211> 1323 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (31) - - (1323) 

<223> 

<400> 37 

ttctggtata gattaaggaa ggaggagacc atg tta ccc tta ttc aag caa ttt 54 

Met Leu Pro Leu Phe Lys Gin Phe 
1 5 



tac aag caa age etc age cag cgc etc aaa get eta gaa aag gee ggc 
Tyr Lys Gin Ser Leu Ser Gin Arg Leu Lys Ala Leu Glu Lys Ala Gly 
10 15 20 



102 



tat ctt gat cct gac cag gcg ggt aaa etc cag tea ggg gaa ctg ggt 150 
Tyr Leu Asp Pro Asp Gin Ala Gly Lys Leu Gin Ser Gly Glu Leu Gly 
25 30 35 40 

ttg ace cat gaa gec ggc gac cac atg att gaa aac tac ate ggc tec 198 
Leu Thr His Glu Ala Gly Asp His Met lie Glu Asn Tyr lie Gly Ser 

45 50 55 



tat acc etc cct ctg gga ctg gee etc cac ttt tta etc gat ggc aag 
Tyr Thr Leu Pro Leu Gly Leu Ala Leu His Phe Leu Leu Asp Gly Lys 



246 
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60 65 70 

age tac eta gtc ccc atg get att gaa gag ccc tct gtc att gec get 
Ser Tyr Leu Val Pro Met Ala lie Glu Glu Pro Ser Val lie Ala Ala 
75 80 85 

gee age aac ggt gec aag atg gta gee caa age ggt ggt ttc cat aca 
Ala Ser Asn Gly Ala Lys Met Val Ala Gin Ser Gly Gly Phe His Thr 
90 95 100 

gtc aag gaa aac egg ctg atg ate ggt caa gtg gtc ata gec gga age 
Val Lys Glu Asn Arg Leu Met lie Gly Gin Val Val lie Ala Gly Ser 
105 110 115 120 

aca aaa cct age cag gac egg gga aaa ate ctg age cac cag caa gac 
Thr Lys Pro Ser Gin Asp Arg Gly Lys lie Leu Ser His Gin Gin Asp 

125 130 135 

tta ate gac eta gec aat get age tat ccc tea att ggt aaa aga ggg 
Leu lie Asp Leu Ala Asn Ala Ser Tyr Pro Ser lie Gly Lys Arg Gly 

140 145 150 



cag gat atg gga age tat ctg gca gtc tac ttg act gtt gac tgc cag 
Gin Asp Met Gly Ser Tyr Leu Ala Val Tyr Leu Thr Val Asp Cys Gin 
170 175 180 

gaa gee atg ggg get aac att ate aac ace atg ctg gaa gec ctg get 
Glu Ala Met Gly Ala Asn lie He Asn Thr Met Leu Glu Ala Leu Ala 
185 190 195 200 

cct gaa att gac cgc eta acc age ggc cag gtc ttg atg tec ate tta 
Pro Glu He Asp Arg Leu Thr Ser Gly Gin Val Leu Met Ser He Leu 

205 210 215 

tct aac ctg gee act gaa tec ctt gtc act gtt tec tgt caa gta aaa 
Ser Asn Leu Ala Thr Glu Ser Leu Val Thr Val Ser Cys Gin Val Lys 

220 225 230 

ccc aga ttt tta gtc aaa aat gac atg gca ggg gaa get gtc egg gac 
Pro Arg Phe Leu Val Lys Asn Asp Met Ala Gly Glu Ala Val Arg Asp 
235 240 245 

caa ate ate cag gec tac cag tat gee tgc ctg gac ccc tac egg gca 
Gin He He Gin Ala Tyr Gin Tyr Ala Cys Leu Asp Pro Tyr Arg Ala 
250 255 260 

gee acc cac aac aag ggg ate atg aac ggg gta gac ggc ttg gtc eta 
Ala Thr His Asn Lys Gly He Met Asn Gly Val Asp Gly Leu Val Leu 
265 270 275 280 

get agt ggg aat gat tgg egg gca ate gaa gcg ggg gee cat get tac 
Ala Ser Gly Asn Asp Trp Arg Ala He Glu Ala Gly Ala His Ala Tyr 

285 290 295 



294 



342 



390 



438 



486 



ggt ggg gee cga ggc att caa gtc aaa cag ttt gac tea gac ctg ggc 534 
Gly Gly Ala Arg Gly He Gin Val Lys Gin Phe Asp Ser Asp Leu Gly 
155 160 165 



582 



630 



678 



726 



774 



822 



870 



918 
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get agt ttg acc ggc cac tac cgc ccc ttg tec aag tgg gaa aag acc 
Ala Ser Leu Thr Gly His Tyr Arg Pro Leu Ser Lys Trp Glu Lys Thr 

300 305 310 

caa gac gga cag tta aaa ggg acc att acc ctt ccc ttg cca att gec 
Gin Asp Gly Gin Leu Lys Gly Thr lie Thr Leu Pro Leu Pro He Ala 
315 320 325 

aca gtt ggt ggg get att gec tec cac cct gta gec caa gtt age cag 
Thr Val Gly Gly Ala lie Ala Ser His Pro Val Ala Gin Val Ser Gin 
330 335 340 

caa ate tta ggc caa cct act get aag caa tta gec egg ctg gtt gca 
Gin He Leu Gly Gin Pro Thr Ala Lys Gin Leu Ala Arg Leu Val Ala 
345 350 355 360 

gca gtg gga eta gec cag aac eta tec get ctt cgt gec tta gtc aca 
Ala Val Gly Leu Ala Gin Asn Leu Ser Ala Leu Arg Ala Leu Val Thr 

365 370 375 

act ggt att caa caa gga cac atg gee etc cag gca agg tct ttg gee 
Thr Gly He Gin Gin Gly His Met Ala Leu Gin Ala Arg Ser Leu Ala 

380 385 390 

atg aat gee ggg gee egg gga gac aag ate caa aag ctg gca gac cgc 
Met Asn Ala Gly Ala Arg Gly Asp Lys He Gin Lys Leu Ala Asp Arg 
395 400 405 

tta att aac caa gac caa atg aac eta gca act gee cgt gee ctg etc 
Leu He Asn Gin Asp Gin Met Asn Leu Ala Thr Ala Arg Ala Leu Leu 
410 415 420 

aag aac atg gaa gaa gac taa 
Lys Asn Met Glu Glu Asp 
425 430 



<210> 38 
<211> 430 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 38 

Met Leu Pro Leu Phe Lys Gin Phe Tyr Lys Gin Ser Leu Ser Gin Arg 
15 10 15 



Leu Lys Ala Leu Glu Lys Ala Gly Tyr Leu Asp Pro Asp Gin Ala Gly 

20 25 30 



Lys Leu Gin Ser Gly Glu Leu Gly Leu Thr His Glu Ala Gly Asp His 
35 40 45 



966 



1014 



1062 



1110 



1158 



1206 



1254 



1302 



1323 



Met He Glu Asn Tyr He Gly Ser Tyr Thr Leu Pro Leu Gly Leu Ala 
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50 55 60 



Leu His Phe Leu Leu Asp Gly Lys Ser Tyr Leu Val Pro Met Ala lie 
65 70 75 80 



Glu Glu Pro Ser Val lie Ala Ala Ala Ser Asn Gly Ala Lys Met Val 

85 90 95 



Ala Gin Ser Gly Gly Phe His Thr Val Lys Glu Asn Arg Leu Met lie 

100 105 110 



Gly Gin Val Val lie Ala Gly Ser Thr Lys Pro Ser Gin Asp Arg Gly 
115 120 125 



Lys He Leu Ser His Gin Gin Asp Leu He Asp Leu Ala Asn Ala Ser 
130 135 140 



Tyr Pro Ser He Gly Lys Arg Gly Gly Gly Ala Arg Gly He Gin Val 
145 150 155 160 



Lys Gin Phe Asp Ser Asp Leu Gly Gin Asp Met Gly Ser Tyr Leu Ala 

165 170 175 



Val Tyr Leu Thr Val Asp Cys Gin Glu Ala Met Gly Ala Asn He He 

180 185 190 



Asn Thr Met Leu Glu Ala Leu Ala Pro Glu He Asp Arg Leu Thr Ser 
195 200 205 



Gly Gin Val Leu Met Ser He Leu Ser Asn Leu Ala Thr Glu Ser Leu 
210 215 220 



Val Thr Val Ser Cys Gin Val Lys Pro Arg Phe Leu Val Lys Asn Asp 
225 230 235 240 



Met Ala Gly Glu Ala Val Arg Asp Gin He He Gin Ala Tyr Gin Tyr 

245 250 255 



Ala Cys Leu Asp Pro Tyr Arg Ala Ala Thr His Asn Lys Gly He Met 

260 265 270 



Asn Gly Val Asp Gly Leu Val Leu Ala Ser Gly Asn Asp Trp Arg Ala 
275 280 . 285 
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lie Glu Ala Gly Ala His Ala Tyr Ala Ser Leu Thr Gly His Tyr Arg 
290 295 300 



Pro Leu Ser Lys Trp Glu Lys Thx Gin Asp Gly Gin Leu Lys Gly Thr 
305 310 315 320 



lie Thr Leu Pro Leu Pro lie Ala Thr Val Gly Gly Ala lie Ala Ser 

325 330 335 



His Pro Val Ala Gin Val Ser Gin Gin lie Leu Gly Gin Pro Thr Ala 

340 345 350 



Lys Gin Leu Ala Arg Leu Val Ala Ala Val Gly Leu Ala Gin Asn Leu 
355 360 365 



Ser Ala Leu Arg Ala Leu Val Thr Thr Gly He Gin Gin Gly His Met 
370 375 380 



Ala Leu Gin Ala Arg Ser Leu Ala Met Asn Ala Gly Ala Arg Gly Asp 
385 390 395 400 



Lys He Gin Lys Leu Ala Asp Arg Leu He Asn Gin Asp Gin Met Asn 

405 410 415 



Leu Ala Thr Ala Arg Ala Leu Leu Lys Asn Met Glu Glu Asp 

420 425 430 



<210> 39 
<211> 930 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (13) . . (930) 

<223> 

<400> 39 

aggattagta aa atg tta ttt gat cgt ate gta gaa gec ttt ccc gaa age 51 

Met Leu Phe Asp Arg He Val Glu Ala Phe Pro Glu Ser 
15 10 

aac ate aaa aaa gat gaa ccc ttg tec tat tac tct tac act cga aca 99 
Asn He Lys Lys Asp Glu Pro Leu Ser Tyr Tyr Ser Tyr Thr Arg Thr 
15 20 25 
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ggt ggc ccg get gac att ttg att ttc cca gaa tec ate gat gaa att 147 
Gly Gly Pro Ala Asp He Leu He Phe Pro Glu Ser He Asp Glu He 
30 35 40 45 



gtg acg att ate aag tgg ate aac caa agt ccg gaa tac caa get ggc 
Val Thr He He Lys Trp He Asn Gin Ser Pro Glu Tyr Gin Ala Gly 

50 55 60 

gat etc ccc etc act ate tta ggc aat get age aac ctg ate gta aaa 
Asp Leu Pro Leu Thr He Leu Gly Asn Ala Ser Asn Leu He Val Lys 

65 70 75 

gat ggt ggg ata aga ggg att acc ate att acc acc ggc att aaa acc 
Asp Gly Gly He Arg Gly He Thr He He Thr Thr Gly He Lys Thr 
80 85 90 

att tgt cac gaa gag aac egg ate act gcg ggc get gga gca get att 
He Cys His Glu Glu Asn Arg He Thr Ala Gly Ala Gly Ala Ala He 
95 100 105 

ate gat gtt age' cag get gee ttg gac cat age tta act ggc ttg gaa 
He Asp Val Ser Gin Ala Ala Leu Asp His Ser Leu Thr Gly Leu Glu 
HO 115 120 125 



get ggg get tac ggt ggg gaa gtc cag cat tgt gtt gaa agt gtc caa 
Ala Gly Ala Tyr Gly Gly Glu Val Gin His Cys Val Glu Ser Val Gin 

145 150 155 



195 



243 



291 



339 



387 



ttc get tgt ggc ata ccg ggt agt aca ggc ggg get gtt tac atg aac 435 
Phe Ala Cys Gly He Pro Gly Ser Thr Gly Gly Ala Val Tyr Met Asn 

130 135 140 



483 



gtc ttg acc egg cat ggc cag ttg aag acc tat agt aat gcg gaa atg 531 
Val Leu Thr Arg His Gly Gin Leu Lys Thr Tyr Ser Asn Ala Glu Met 
160 165 170 

aac ttc tec tac cgc cac agt tat ttg atg gaa gaa gac gat ata gta 579 
Asn Phe Ser Tyr Arg His Ser Tyr Leu Met Glu Glu Asp Asp He Val 
175 180 185 

gtc tec gtg acc ttt aaa ttg gag teg ggc gac tac ate act ate aag 627 
Val Ser Val Thr Phe Lys Leu Glu Ser Gly Asp Tyr He Thr He Lys 
190 195 200 205 

gaa aag atg gat gaa tta acc tac ctt aga gaa tec aaa caa ccg ctg 
Glu Lys Met Asp Glu Leu Thr Tyr Leu Arg Glu Ser Lys Gin Pro Leu 

210 215 220 

gaa tac ccc tct tgt ggg tea gtc ttt aaa aga cct gaa ggc cac ttt 723 
Glu Tyr Pro Ser Cys Gly Ser Val Phe Lys Arg Pro Glu Gly His Phe 

one -> -> A *51 C 



675 



771 



225 230 235 

acg ggg aaa tta ate cag gat get ggc ctt caa gga ttg gtc cat ggt 

Thr Gly Lys Leu He Gin Asp Ala Gly Leu Gin Gly Leu Val His Gly 
240 245 250 

gga gec cag gta tec gaa aaa cat gee ggt ttt ate att aat ata ggc 819 



WO 03/104391 
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Gly Ala Gin Val Ser Glu Lys His Ala Gly Phe He He Asn He Gly 
255 260 265 

aat get acc gec age gac tac caa gag ttg ate caa cat ate caa gaa 
Asn Ala Thr Ala Ser Asp Tyr Gin Glu Leu He Gin His He Gin Glu 
270 275 280 285 

gaa gtc tac egg att tac aag gtt aag ctg gaa cgt gaa gtt cgc att 
Glu Val Tyr Arg He Tyr Lys Val Lys Leu Glu Arg Glu Val Arg He 

290 295 300 

ata ggg gag gat tag 
He Gly Glu Asp 

305 



<210> 40 
<211> 305 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 40 

Met Leu Phe Asp Arg He Val Glu Ala Phe Pro Glu Ser Asn He Lys 
!" 5 10 15 

Lys Asp Glu Pro Leu Ser Tyr Tyr Ser Tyr Thr Arg Thr Gly Gly Pro 

20 25 30 



Ala Asp He Leu He Phe Pro Glu Ser He Asp Glu He Val Thr He 
35 40 45 



He Lys Trp He Asn Gin Ser Pro Glu Tyr Gin Ala Gly Asp Leu Pro 
50 55 60 



Leu Thr He Leu Gly Asn Ala Ser Asn Leu He Val Lys Asp Gly Gly 
65 70 75 80 

He Arg Gly He Thr He He Thr Thr Gly He Lys Thr He Cys His 

85 90 95 



Glu Glu Asn Arg He Thr Ala Gly Ala Gly Ala Ala He He Asp Val 

100 105 110 



Ser Gin Ala Ala Leu Asp His Ser Leu Thr Gly Leu Glu Phe Ala Cys 
115 120 125 



867 



915 



930 



Gly He Pro Gly Ser Thr Gly Gly Ala Val Tyr Met Asn Ala Gly Ala 
130 135 140 
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Tyr Gly Gly Glu Val Gin His Cys Val Glu Ser Val Gin Val Leu Thr 
145 150 155 160 

Arg His Gly Gin Leu Lys Thr Tyr Ser Asn Ala Glu Met Asn Phe Ser 

165 170 175 

Tyr Arg His Ser Tyr Leu Met Glu Glu Asp Asp lie Val Val Ser Val 

180 185 190 



Thr Phe Lys Leu Glu Ser Gly Asp Tyr lie Thr lie Lys Glu Lys Met 
195 200 205 



Asp Glu Leu Thr Tyr Leu Arg Glu Ser Lys Gin Pro Leu Glu Tyr Pro 
210 215 220 

Ser Cys Gly Ser Val Phe Lys Arg Pro Glu Gly His Phe Thr Gly Lys 
225 230 235 240 

Leu He Gin Asp Ala Gly Leu Gin Gly Leu Val His Gly Gly Ala Gin 

245 250 255 



Val Ser Glu Lys His Ala Gly Phe He He Asn He Gly Asn Ala Thr 

260 265 270 



Ala Ser Asp Tyr Gin Glu Leu He Gin His He Gin Glu Glu Val Tyr 
275 280 285 

Arg He Tyr Lys Val Lys Leu Glu Arg Glu Val Arg He He Gly Glu 
290 295 300 



Asp 
305 



<210> 41 
<211> 1104 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (1104) 

<223> 

<400> 41 

aaagctggtg ttttc atg gtt tat age tta agg att ccg ggg aaa ctt tat 



51 



WO 03/104391 
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Met Val Tyr Ser Leu Arg He Pro Gly Lys Leu Tyr 
1 5. 10 

ttg gca ggt gaa tac gca gta gta acc ccc ggc tat gcc ggg ate ttg 
Leu Ala Gly Glu Tyr Ala Val Val Thr Pro Gly Tyr Ala Gly He Leu 
15 20 25 

ctg aca gtc age egg tat ttg act tta gac att tgg gaa aca tct ccc 
Leu Thr Val Ser Arg Tyr Leu Thr Leu Asp He Trp Glu Thr Ser Pro 
30 35 40 

gac caa get tea gtc agg tct caa aca tat ggc aac cag gcc tat get 
Asp Gin Ala Ser Val Arg Ser Gin Thr Tyr Gly Asn Gin Ala Tyr Ala 
45 50 55 60 

tgg gag egg tta gat ggt ate ttt age ttt aag gac tgg tec cac ccc 
Trp Glu Arg Leu Asp Gly He Phe Ser Phe Lys Asp Trp Ser His Pro 

65 70 75 

ttc cac eta gtc gaa acg gtg ate caa aca gtg gaa gcc tac ata gaa 
Phe His Leu Val Glu Thr Val lie Gin Thr Val Glu Ala Tyr He Glu 

80 85 90 

tec ttg tec ctg cct tta aaa agt tac ggg att cag ate aag age cag 
Ser Leu Ser Leu Pro Leu Lys Ser Tyr Gly He Gin He Lys Ser Gin 
95 100 105 

ttg gac tac cag ggc aaa aaa att ggc ctg ggg tct agt ggg gcc gtt 
Leu Asp Tyr Gin Gly Lys Lys He Gly Leu Gly Ser Ser Gly Ala Val 
110 115 120 

acc ate get gtt ate cga ggc ctg age ctt ctt tac gac etc cac tta 
Thr He Ala Val lie Arg Gly Leu Ser Leu Leu Tyr Asp Leu His Leu 
125 130 135 140 

aaa gac ata gac att ttc aaa eta get gcc ate gcc cat ate cag eta 
Lys Asp He Asp He Phe Lys Leu Ala Ala He Ala His He Gin Leu 

145 150 155 



99 



147 



195 



243 



291 



339 



387 



43 5 



483 



aag age aag ggg tct ttt ggc gat ttg gca gcc tgc act tat act ggt 531 
Lys Ser Lys Gly Ser Phe Gly Asp Leu Ala Ala Cys Thr Tyr Thr Gly 

160 165 170 



579 



627 



gtg ate cgc tac cag tec ctg gat aga gaa tgg tta caa gaa caa ate 
Val He Arg Tyr Gin Ser Leu Asp Arg Glu Trp Leu Gin Glu Gin He 
175 180 185 

tec aac cat tec ate aag gac etc ctg gcc atg gat tgg cct age eta 
Ser Asn His Ser He Lys Asp Leu Leu Ala Met Asp Trp Pro Ser Leu 
190 195 200 

ggt eta gac egg etc age ctg ccc cat gac etc agg ctt tta ate gga 
Gly Leu Asp Arg Leu Ser Leu Pro His Asp Leu Arg Leu Leu lie Gly 
205 210 215 220 

tgg acc ggc cag cct gcc tec aca gaa aaa ttg gtt cag get gtc tac 723 
Trp Thr Gly Gin Pro Ala Ser Thr Glu Lys Leu Val Gin Ala Val Tyr 



675 
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225 230 235 

ccc caa aaa ata acc agg acc ccc ttg gac ttc cag tec ttc tta gac 
Pro Gin Lys lie Thr Arg Thr Pro Leu Asp Phe Gin Ser Phe Leu Asp 

240 245 250 

caa tec caa gag tgt gtc gac ggc ttg gtg gag tct tta age cag get 
Gin Ser Gin. Glu Cys Val Asp Gly Leu Val Glu Ser Leu Ser Gin Ala 
255 260 265 

gac tec cag gca age tta get tgg ate caa aag aac cga acc etc etc 
Asp Ser Gin Ala Ser Leu Ala Trp He Gin Lys Asn Arg Thr Leu Leu 
270 275 280 



acc tac ttg tgc gat att gtc gcg aaa tac gga ggc caa gec aag tct 
Thr Tyr Leu Cys Asp He Val Ala Lys Tyr Gly Gly Gin Ala Lys Ser 

305 310 315 

tec ggt gee ggc ggt gga gat tgt ggc att ggc eta ate aca agg gag 
Ser Gly Ala Gly Gly Gly Asp Cys Gly He Gly Leu He Thr Arg Glu 

320 325 330 

age cca ata gaa gec ate tac egg gaa tgg atg gat gca ggt ate ttg 
Ser Pro He Glu Ala lie Tyr Arg Glu Trp Met Asp Ala Gly He Leu 
335 340 345 

ccc tta aga eta gac att gta gaa aat ggt get tgc tat gac taa 
Pro Leu Arg Leu Asp He Val Glu Asn Gly Ala Cys Tyr Asp 
350 355 360 



<210> 42 
<211> 362 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 42 

Met Val Tyr Ser Leu Arg He Pro Gly Lys Leu Tyr Leu Ala Gly Glu 
15 10 15 

Tyr Ala Val Val Thr Pro Gly Tyr Ala Gly He Leu Leu Thr Val Ser 

20 25 30 

Arg Tyr Leu Thr Leu Asp He Trp Glu Thr Ser Pro Asp Gin Ala Ser 
35 40 45 



771 



819 



867 



aag gca atg ggc caa age egg ggg aaa gtc ate gaa acc aaa gee ttg 915 
Lys Ala Met Gly Gin Ser Arg Gly Lys Val He Glu Thr Lys Ala Leu 
285 290 295 300 



963 



1011 



1059 



1104 



Val Arg Ser Gin Thr Tyr Gly Asn Gin Ala Tyr Ala Trp Glu Arg Leu 
50 55 60 



WO 03/104391 PCT/US02/36122 
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Asp Gly He Phe Ser Phe Lys Asp Trp Ser His Pro Phe His Leu Val 
65 70 75 80 



Glu Thr Val He Gin Thr Val Glu Ala Tyr He Glu Ser Leu Ser Leu 

85 90 . 95 



Pro Leu Lys Ser Tyr Gly He Gin He Lys Ser Gin Leu Asp Tyr Gin 

100 105 HO 



Gly Lys Lys He Gly Leu Gly Ser Ser Gly Ala Val Thr He Ala Val 
115 120 125 



He Arg Gly Leu Ser Leu Leu Tyr Asp Leu His Leu Lys Asp He Asp 
130 135 140 

lie Phe Lys Leu Ala Ala He Ala His He Gin Leu Lys Ser Lys Gly 
145 150 155 160 

Ser Phe Gly Asp Leu Ala Ala Cys Thr Tyr Thr Gly Val He Arg Tyr 

165 170 175 



Gin Ser Leu Asp Arg Glu Trp Leu Gin Glu Gin He Ser Asn His Ser 

180 185 190 



He Lys Asp Leu Leu Ala Met Asp Trp Pro Ser Leu Gly Leu Asp Arg 
195 200 205 



Leu Ser Leu Pro His Asp Leu Arg Leu Leu He Gly Trp Thr Gly Gin 
210 215 220 

Pro Ala Ser Thr Glu Lys Leu Val Gin Ala Val Tyr Pro Gin Lys He 
225 230 235 240 

Thr Arg Thr Pro Leu Asp Phe Gin Ser Phe Leu Asp Gin Ser Gin Glu 

245 250 255 



Cys Val Asp Gly Leu Val Glu Ser Leu Ser Gin Ala Asp Ser Gin Ala 

260 265 270 



Ser Leu Ala Trp He Gin Lys Asn Arg Thr Leu Leu Lys Ala Met Gly 
275 280 285 



Gin Ser Arg Gly Lys Val He Glu Thr Lys Ala Leu Thr Tyr Leu Cys 
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290 295 300 

Asp He Val Ala Lys Tyr Gly Gly Gin Ala Lys Ser Ser Gly Ala Gly 
305 310 315 320 

Gly Gly Asp Cys Gly He Gly Leu He Thr Arg Glu Ser Pro He Glu 

325 330 335 

Ala He Tyr Arg Glu Trp Met Asp Ala Gly He Leu Pro Leu Arg Leu 

340 345 350 



Asp He Val Glu Asn Gly Ala Cys Tyr Asp 
355 360 



<210> 43 
<211> 1023 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (13) . . (1023) 

<223> 

<400> 43 

gagaagccaa cc atg act aag cag gcc ttt gaa aag aaa aag tta ggc egg 

Met Thr Lys Gin Ala Phe Glu Lys Lys Lys Leu Gly Arg 
1 5 10 

att tgc egg gcc cat acc aac att gcc ttg ate aag tac tgg ggt aag 
He Cys Arg Ala His Thr Asn He Ala Leu He Lys Tyr Trp Gly Lys 
15 20 25 

get gat agg gac ttg att ate ccc aat aac aac tec eta tct tta acc 
Ala Asp Arg Asp Leu He He Pro Asn Asn Asn Ser Leu Ser Leu Thr 



30 



35 40 45 



ttg gac get ttt tat acc gat acc cag gta- gtt ttt gac cca gac ttg 
Leu Asp Ala Phe Tyr Thr Asp Thr Gin Val Val Phe Asp Pro Asp Leu 

50 55 60 

gac cag gac caa tta tgg eta gac ggg aaa cag gaa aaa ggg tec gcc 
Asp Gin Asp Gin Leu Trp Leu Asp Gly Lys Gin Glu Lys Gly Ser Ala 

65 70 75 

tta acc aag gcc cag gtc ate ctg gac ttg gtt egg gac caa gcc cag 
Leu Thr Lys Ala Gin Val He Leu Asp Leu Val Arg Asp Gin Ala Gin 
80 85 90 

ctt gac tgg ccg gcc aaa att acc age cac aac caa gtt gcc act gca 
Leu Asp Trp Pro Ala Lys He Thr Ser His Asn Gin Val Ala Thr Ala 
95 100 105 



51 



99 



147 



195 



243 



291 



339 



WO 03/104391 



95/235 



PCT/US02/36122 



get ggc ttg get tec tct get tct ggt ctg gee gee ttg gcg ggt get 387 
Ala Gly Leu Ala Ser Ser Ala Ser Gly Leu Ala Ala Leu Ala Gly Ala 
110 115 120 125 



tea get gat get tta gac ctt ggc eta tec cca act gac etc tec cga 
Ser Ala Asp Ala Leu Asp Leu Gly Leu Ser Pro Thr Asp Leu Ser Arg 

130 135 140 



age gac cga cca aag gca att tec tec age caa ggc atg caa ttg ace 
Ser Asp Arg Pro Lys Ala lie Ser Ser Ser Gin Gly Met Gin Leu Thr 
190 195 200 205 

cag gag acg teg gac ttt tac cag gee tgg tta gac age ctg gac caa 
Gin Glu Thr Ser Asp Phe Tyr Gin Ala Trp Leu Asp Ser Leu Asp Gin 

210 215 220 



ctg gca gee aag ccc ccc ttc acc tat tgg act aaa gaa agt tta gec 
Leu Ala Ala Lys Pro Pro Phe Thr Tyr Trp Thr Lys Glu Ser Leu Ala 
255 260 265 

ctg atg cag gaa gta tgg gac egg cgc aag get ggc cag tec etc tac 
Leu Met Gin Glu Val Trp Asp Arg Arg Lys Ala Gly Gin Ser Leu Tyr 
270 275 280 285 



435 



ttg gee cgc agg gga tct ggg tct gee tea cga agt att ttt ggt ggt 483 

Leu Ala Arg Arg Gly Ser Gly Ser Ala Ser Arg Ser He Phe Gly Gly 

145 150 155 

ttt gtc gag tgg gaa aag ggt cat gat gat age tct tec ttt gee aag 531 

Phe Val Glu Trp Glu Lys Gly His Asp Asp Ser Ser Ser Phe Ala Lys 

160 165 170 

ccc ate gac ttg gec cag tgg gat att gee atg etc ttt gtc att gta 579 

Pro lie Asp Leu Ala Gin Trp Asp He Ala Met Leu Phe Val He Val 

175 180 185 



627 



675 



gac eta gca gac ate aag tec get ate caa gec caa gac etc gac cag 723 

Asp Leu Ala Asp He Lys Ser Ala He Gin Ala Gin Asp Leu Asp Gin 

225 230 235 

gtt ggg tec att gca gaa aga aat gee ctg aaa atg cat gee acc aac 771 

Val Gly Ser He Ala Glu Arg Asn Ala Leu Lys Met His Ala Thr Asn 
240 245 250 



819 



867 



ttc acc atg gac gee ggc ccc aat gtc aag gtt att ggc agg gaa get 915 
Phe Thr Met Asp Ala Gly Pro Asn Val Lys Val He Gly Arg Glu Ala 

290 295 300 

gac ctt aaa gee ttc aaa gca gac etc age caa gac tgg ccc gac aag 963 
Asp Leu Lys Ala Phe Lys Ala Asp Leu Ser Gin Asp Trp Pro Asp Lys 

305 310 315 

cat ctt gtc tta get aaa ccg ggt cca ggc ctg gec ttt att gat gga 1011 
His Leu Val Leu Ala Lys Pro Gly Pro Gly Leu Ala Phe He Asp Gly 
320 325 330 
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cct ttg aac tag 
Pro Leu Asn 
335 



<210> 44 
<211> 336 
<212> PRT 

<213> Alloiococcus otitidis 



1023 



<400> 44 ^ 
Met Thr Lys Gin Ala Phe Glu Lys Lys Lys Leu Gly Arg He Cys Arg 

* 10 15 



Ala His Thr Asn He Ala Leu He Lys Tyr Trp Gly Lys Ala Asp Arg 

20 25 30 

Asp Leu He He Pro Asn Asn Asn Ser Leu Ser Leu Thr Leu Asp Ala 
35 40 45 

Phe Tyr Thr Asp Thr Gin Val Val Phe Asp Pro Asp Leu Asp Gin Asp 
50 55 60 

Gin Leu Trp Leu Asp Gly Lys Gin Glu Lys Gly Ser Ala Leu Thr Lys 
65 70 75 80 

Ala Gin Val He Leu Asp Leu Val Arg Asp Gin Ala Gin Leu Asp Trp 

85 90 95 

Pro Ala Lys He Thr Ser His Asn Gin Val Ala Thr Ala Ala Gly Leu 

100 105 HO 

Ala Ser Ser Ala Ser Gly Leu Ala Ala Leu Ala Gly Ala Ser Ala Asp 
115 120 125 

Ala Leu Asp Leu Gly Leu Ser Pro Thr Asp Leu Ser Arg Leu Ala Arg 
130 135 140 

Arg Gly Ser Gly Ser Ala Ser Arg Ser He Phe Gly Gly Phe Val Glu 
145 150 155 160 

Trp Glu Lys Gly His Asp Asp Ser Ser Ser Phe Ala Lys Pro He Asp 

165 170 175 



Leu Ala Gin Trp Asp He Ala Met Leu Phe Val He Val Ser Asp Arg 

180 185 190 



WO 03/104391 



97/235 



PCT/US02/36122 



Pro Lys Ala lie Ser Ser Ser Gin Gly Met Gin Leu Thr Gin Glu Thr 
195 200 205 



Ser Asp Phe Tyr Gin Ala Trp Leu Asp Ser Leu Asp Gin Asp Leu Ala 
210 215 220 



Asp lie Lys Ser Ala lie Gin Ala Gin Asp Leu Asp Gin Val Gly Ser 
225 230 235 240 



He Ala Glu Arg Asn Ala Leu Lys Met His Ala Thr Asn Leu Ala Ala 

245 250 255 



Lys Pro Pro Phe Thr Tyr Trp Thr Lys Glu Ser Leu Ala Leu Met Gin 

260 265 270 



Glu Val Trp Asp Arg Arg Lys Ala Gly Gin Ser Leu Tyr Phe Thr Met 
275 280 285 



Asp Ala Gly Pro Asn Val Lys Val He Gly Arg Glu Ala Asp Leu Lys 
290 295 300 



Ala Phe Lys Ala Asp Leu Ser Gin Asp Trp Pro Asp Lys His Leu Val 
305 310 315 320 



Leu Ala Lys Pro Gly Pro Gly Leu Ala Phe He Asp Gly Pro Leu Asn 

325 330 335 



<210> 45 
<211> 981 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (28) . . (981) 

<223> 

<400> 45 

acaaaaatag acaaaggaga caaaagg atg acg ctt gtt aaa aat gta gcc aaa 

Met Thr Leu Val Lys Asn Val Ala Lys 
1 5 



54 



ggc act gcc cat ggt aaa att att tta ate ggt gag cat get gtt gtc 



102 
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Gly Thr Ala His Gly Lys He He Leu He Gly Glu His Ala Val Val 
10 15 20 25 

tat aac atg ccg gcc ate gec etc cct ttt acc aca gec acc ate acc 
Tyr Asn Met Pro Ala He Ala Leu Pro Phe Thr Thr Ala Thr He Thr 

30 35 40 

gtt gaa gtt agt cct tac caa ggc aaa age tat eta gaa agt get tgc 
Val Glu Val Ser Pro Tyr Gin Gly Lys Ser Tyr Leu Glu Ser Ala Cys 

45 50 55 

tac tgc gga tct tta gac caa gcg ccc ggg gac ttg gca ggg ctt caa 
Tyr Cys Gly Ser Leu Asp Gin Ala Pro Gly Asp Leu Ala Gly Leu Gin 
60 65 70 

gcc tgt ttg aca gcg gtt tgt gcc gac tta gac cag tec age gac cac 
Ala Cys Leu Thr Ala Val Cys Ala Asp Leu Asp Gin Ser Ser Asp His 
75 80 85 

ttg tat ate aag gtc gac age atg ate cct get gaa aga gga atg ggg 
Leu Tyr He Lys Val Asp Ser Met He Pro Ala Glu Arg Gly Met Gly 
90 95 100 105 

tec agt get get gtg gcc acc gcc tta gtc aag gcc etc ttt cac tac 
Ser Ser Ala Ala Val Ala Thr Ala Leu Val Lys Ala Leu Phe His Tyr 

110 115 120 

ttc caa gtc gac tta age agt gaa gcc etc tea gcc tat gtc gag att 
Phe Gin Val Asp Leu Ser Ser Glu Ala Leu Ser Ala Tyr Val Glu He 

125 130 135 

gcc gaa aaa att acc cat ggc aag cca teg ggt ctg gat get aca gtc 
Ala Glu Lys He Thr His Gly Lys Pro Ser Gly Leu Asp Ala Thr Val 
140 145 150 

gtc aac tec att gcc ccc gtt tat ttt aaa cgc aac cag ctt ccc aag 
Val Asn Ser He Ala Pro Val Tyr Phe Lys Arg Asn Gin Leu Pro Lys 
155 160 165 

gcc ate cct tta aat gtt gac ggc tat tta att gca gcc gat act ggg 
Ala He Pro Leu Asn Val Asp Gly Tyr Leu He Ala Ala Asp Thr Gly 
170 175 180 185 

att aag ggc cac acg aaa gaa gcc gtt ggg gat gtg gcg aag ctg gtt 
He Lys Gly His Thr Lys Glu Ala Val Gly Asp Val Ala Lys Leu Val 

190 195 200 

gaa act gcc aag gtt caa acc atg gac att gtc cac cac etc ggc cag 
Glu Thr Ala Lys Val Gin Thr Met Asp He Val His His Leu Gly Gin 

205 210 215 

ctt acc cac cag get aaa aaa gca ate atg acc aat aac etc cct ggc 
Leu Thr His Gin Ala Lys Lys Ala He Met Thr Asn Asn Leu Pro Gly 
220 225 230 

tta ggg gag att ttg aac cag tec cac caa etc tta aag gat tta act 
Leu Gly Glu He Leu Asn Gin Ser His Gin Leu Leu Lys Asp Leu Thr 



150 



198 



246 



294 



342 



390 



438 



486 



534 



582 



630 



678 



726 



774 
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235 240 245 

gtc age aat ccc aag tta gac caa ctt gtc caa gca gec caa gat get 
Val Ser Asn Pro Lys Leu Asp Gin Leu Val Gin Ala Ala Gin Asp Ala 
250 255 260 265 

gga get tgc gga get aag tta ace ggt ggg ggc egg ggt ggt tgc atg 
Gly Ala Cys Gly Ala Lys Leu Thr Gly Gly Gly Arg Gly Gly Cys Met 

270 275 280 

att gee eta gee caa age aac cag gat gee tec aat att gec caa aaa 
He Ala Leu Ala Gin Ser Asn Gin Asp Ala Ser Asn He Ala Gin Lys 

285 290 295 

ttg gaa aaa gcg gga gee att gaa ace tgg ate cac ccc tta gga gaa 
Leu Glu Lys Ala Gly Ala He Glu Thr Trp He His Pro Leu Gly Glu 
300 305 310 

gee aac cat gac taa 
Ala Asn His Asp 
315 



<210> 46 
<211> 317 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 46 

Met Thr Leu Val Lys Asn Val Ala Lys Gly Thr Ala His Gly Lys lie 
15 10 15 



He Leu He Gly Glu His Ala Val Val Tyr Asn Met Pro Ala He Ala 

20 25 - 30 



Leu Pro Phe Thr Thr Ala Thr He Thr Val Glu Val Ser Pro Tyr Gin 
35 40 45 



Gly Lys Ser Tyr Leu Glu Ser Ala Cys Tyr Cys Gly Ser Leu Asp Gin 
50 55 60 



Ala Pro Gly Asp Leu Ala Gly Leu Gin Ala Cys Leu Thr Ala Val Cys 
65 70 75 80 



Ala Asp Leu Asp Gin Ser Ser Asp His Leu Tyr He Lys Val Asp Ser 

85 90 95 



822 



870 



918 



966 



981 



Met He Pro Ala Glu Arg Gly Met Gly Ser Ser Ala Ala Val Ala Thr 

100 105 110 
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Ala Leu Val Lys Ala Leu Phe His Tyr Phe Gin Val Asp Leu Ser Ser 
115 120 125 



Glu Ala Leu Ser Ala Tyr Val Glu lie Ala Glu Lys lie Thr His Gly 
130 135 140 



Lys Pro Ser Gly Leu Asp Ala Thr Val Val Asn Ser He Ala Pro Val 
145 150 155 160 



Tyr Phe Lys Arg Asn Gin Leu Pro Lys Ala He Pro Leu Asn Val Asp 

165 170 175 



Gly Tyr Leu He Ala Ala Asp Thr Gly lie Lys Gly His Thr Lys Glu 

180 185 190 



Ala Val Gly Asp Val Ala Lys Leu Val Glu Thr Ala Lys Val Gin Thr 
195 200 205 



Met Asp He Val His His Leu Gly Gin Leu Thr His Gin Ala Lys Lys 
210 215 220 



Ala He Met Thr Asn Asn Leu Pro Gly Leu Gly Glu He Leu Asn Gin 
225 230 235 240 



Ser His Gin Leu Leu Lys Asp Leu Thr Val Ser Asn Pro Lys Leu Asp 

245 250 255 



Gin Leu Val Gin Ala Ala Gin Asp Ala Gly Ala Cys Gly Ala Lys Leu 

260 265 270 



Thr Gly Gly Gly Arg Gly Gly- Cys Met lie Ala Leu Ala Gin Ser Asn 
275 280 285 



Gin Asp Ala Ser Asn He Ala Gin Lys Leu Glu Lys Ala Gly Ala He 
290 295 300 



Glu Thr Trp He His Pro Leu Gly Glu Ala Asn His Asp 
305 310 315 



<210> 47 
<211> 975 
<212> DNA 

<213> Alloiococcus otitidis 
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<220> 

<221> CDS 

<222> (46) . . (975) 

<223> 



<400> 47 

agaatcaaat ttgttttaaa attatcagct tttggaggtc agaac atg aac aat tec 

Met Asn Asn Ser 
1 



cgt att ttt tta gtc tat gac cgt aaa gac tgg cag tct ctt aga gaa 105 
Arg lie Phe Leu Val Tyx Asp Arg Lys Asp Trp Gin Ser Leu Arg Glu 
5 10 15 20 

aat gec age ctt tct tta acg gaa aaa aac eta aat aac ttg cgt gca 153 
Asn Ala Ser Leu Ser Leu Thr Glu Lys Asn Leu Asn Asn Leu Arg Ala 

25 30 35 



gtg aat gac gtc ata teg atg gaa gat gtc cga gaa gtt tac gtc ccc 
Val Asn Asp Val lie Ser Met Glu Asp Val Arg Glu Val Tyr Val Pro 

40 45 50 



201 



att ate caa tta ctg gat gtc tac ata aaa agt tac tac cgc cac cag 249 
lie lie Gin Leu Leu Asp Val Tyr lie Lys Ser Tyr Tyr Arg His Gin 
55 60 65 



get tec ttg ate aat tac ttg aac ctg 
Ala Ser Leu lie Asn Tyr Leu Asn Leu 
70 75 

ccc tat gtg att ggg att gca ggg age 
Pro Tyr Val lie Gly He Ala Gly Ser 
85 90 

gtt gec agg ctt ctt aag tec etc ttg 

Val Ala Arg Leu Leu Lys Ser Leu Leu 

105 

aag gta gac etc etc aca aca gat ggc 

Lys Val Asp Leu Leu Thr Thr Asp Gly 

120 125 



gac cag cct aaa aag tac caa 297 
Asp Gin Pro Lys Lys Tyr Gin 
80 

gtg get gtg ggc aag tct acg 345 
Val Ala Val Gly Lys Ser Thr 
95 100 

age gac tac tat ccg gaa aaa 3 93 

Ser Asp Tyr Tyr Pro Glu Lys 
110 115 

ttc ctt tat ccg aat aag att 441 
Phe Leu Tyr Pro Asn Lys He 

130 



tta aaa gag cga gat ate atg gac cgc aag ggt ttt ccc gaa age tat 489 
Leu Lys Glu Arg Asp He Met Asp Arg Lys Gly Phe Pro Glu Ser Tyr 
135 140 145* 



gat atg aaa cgt ttg att aac ttt atg ace gat gtc aaa aat aat gtt 537 
Asp Met Lys Arg Leu He Asn Phe Met Thr Asp Val Lys Asn Asn Val 
150 155 160 

ccc aac ate cag gtg ccc aag tat tec cac caa gtt tac gac ata gta 585 
Pro Asn He Gin Val Pro Lys Tyr Ser His Gin Val Tyr Asp He Val 
165 170 175 180 



gaa ggg gaa agg ttg acc att aac cag cca gac ate ttg att gtc gaa 
Glu Gly Glu Arg Leu Thr lie Asn Gin Pro Asp He Leu He Val Glu 

185 190 195 



633 
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ggg ate aat gtg etc caa ctt cct tct aat gag aag att ttt gtt age 681 

Gly lie Asn Val Leu Gin Leu Pro Ser Asn Glu Lys lie Phe Val Ser 

200 205 210 

gat ttt ttc gac ttc tec ttt tat gtg gat gee tea gaa aat ctg att 729 

Asp Phe Phe Asp Phe Ser Phe Tyr Val Asp Ala Ser Glu Asn Leu lie 

215 220 225 

gaa aaa tgg tac atg caa cgc ttt ggc acc ttt atg gat acc gec ttc 777 

Glu Lys Trp Tyr Met Gin Arg Phe Gly Thr Phe Met Asp Thr Ala Phe 
230 235 240 

caa gac ccc aac aac tat tac tac aag ttt aat gac tgg gac cgc aag 825 

Gin Asp Pro Asn Asn Tyr Tyr Tyr Lys Phe Asn Asp Trp Asp Arg Lys 
245 250 255 260 

gaa get ttt gec tat gee aac caa gtt tgg gaa acg gtt aac eta gaa 873 
Glu Ala Phe Ala Tyr Ala Asn Gin Val Trp Glu Thr Val Asn Leu Glu 

265 270 275 

aac etc agg gaa tat att eta ccc acc cga etc egg get aac etc ate 921 

Asn Leu Arg Glu Tyr lie Leu Pro Thr Arg Leu Arg Ala Asn Leu lie 

280 285 290 

etc cat aaa acc cat aac cac tac ate gac aag att tta etc aaa aaa 969 

Leu His Lys Thr His Asn His Tyr lie Asp Lys lie Leu Leu Lys Lys 

295 300 305 

cac tga 975 
His 



<210> 48 
<211> 309 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 48 

Met Asn Asn Ser Arg lie Phe Leu Val Tyr Asp Arg Lys Asp Trp Gin 
15 10 15 



Ser Leu Arg Glu Asn Ala Ser Leu Ser Leu Thr Glu Lys Asn Leu Asn 

20 25 30 



Asn Leu Arg Ala Val Asn Asp Val lie Ser Met Glu Asp Val Arg Glu 
35 40 45 



Val Tyr- Val Pro lie lie Gin Leu Leu Asp Val Tyr He Lys Ser Tyr 
50 55 60 



Tyr Arg His Gin Ala Ser Leu He Asn Tyr Leu Asn Leu Asp Gin Pro 
65 70 75 80 
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Lys Lys Tyr Gin Pro Tyr Val lie Gly lie Ala Gly Ser Val Ala Val 

85 90 95 



Gly Lys Ser Thr Val Ala Arg Leu Leu Lys Ser Leu Leu Ser Asp Tyr. 

100 105 HO 

Tyr Pro Glu Lys Lys Val Asp Leu Leu Thr Thr Asp Gly Phe Leu Tyr 
115 120 125 

Pro Asn Lys He Leu Lys Glu Arg Asp He Met Asp Arg Lys Gly Phe 
130 135 140 

Pro Glu Ser Tyr Asp Met Lys Arg Leu He Asn Phe Met Thr Asp Val 
X45 150 155 160 

Lys Asn Asn Val Pro Asn He Gin Val Pro Lys Tyr Ser His Gin Val 

165 170 175 

Tyr Asp He Val Glu Gly Glu Arg Leu Thr He Asn Gin Pro Asp lie 

180 185 190 

Leu He Val Glu Gly He Asn Val Leu Gin Leu Pro Ser Asn Glu Lys 
195 200 205 

He Phe Val Ser Asp Phe Phe Asp Phe Ser Phe Tyr Val Asp Ala Ser 
210 215 220 

Glu Asn Leu He Glu Lys Trp Tyr Met Gin Arg Phe Gly Thr Phe Met 
225 230 235 240 

Asp Thr Ala Phe Gin Asp Pro Asn Asn Tyr Tyr Tyr Lys Phe Asn Asp 

245 250 255 

Trp Asp Arg Lys Glu Ala Phe Ala Tyr Ala Asn Gin Val Trp Glu Thr 

260 265 270 



Val Asn Leu Glu Asn Leu Arg Glu Tyr He Leu Pro Thr Arg Leu Arg 
275 280 285 



Ala Asn Leu He Leu His Lys Thr His Asn His Tyr He Asp Lys He 
290 295 300 
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Leu Leu Lys Lys His 
305 



<210> 49 
<211> 846 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (846) 

<223> 

<400> 49 

agctta atg gga gac gat tta aga gaa gaa att ctt gac cga atg aag 
Met Gly Asp Asp Leu Arg Glu Glu lie Leu Asp Arg Met Lys 
15 10 

. gtc caa gcc caa att aac ccc aat gag gaa att cgc egg acc att gac 
Val Gin Ala Gin He Asn Pro Asn Glu Glu He Arg Arg Thr He Asp 
15 20 25 30 



att ttt att ggt ate cgc eta cct tat ggg gat caa ttt gat gaa gca 
He Phe He Gly He Arg Leu Pro Tyr Gly Asp Gin Phe Asp Glu Ala 
80 85 90 



caa ggc att gaa gtt tct gac ttt aac aag ggc aat ate aaa get egg 
Gin Gly He Glu Val Ser Asp Phe Asn Lys Gly Asn He Lys Ala Arg 

130 135 140 

ate cga atg gtg gcc caa tat ggc gta gcg ggt cac ttc cac ggg gcg 
He Arg Met Val Ala Gin Tyr Gly Val Ala Gly His Phe His Gly Ala 
145 * 150 155 



48 



96 



ttt ate aag gac tat etc cag gcc cac ccc ttc ttt gaa tec tta ate 144 
Phe He Lys Asp Tyr Leu Gin Ala His Pro Phe Phe Glu Ser Leu He 

35 40 45 

ttg ggc ate tec ggt ggc cag gat tec acc etc ctg ggt aag eta gcc 192 
Leu Gly He Ser Gly Gly Gin Asp Ser Thr Leu Leu Gly Lys Leu Ala 

50 55 60 

cag atg gcc tgc ctt gaa ctg agg gaa gag gag ggg tct gac aag cca 240 
Gin Met Ala Cys Leu Glu Leu Arg Glu Glu Glu Gly Ser Asp Lys Pro 
65 70 75 



288 



gaa gcc cag caa gcc etc aat tgg ate cag cct gac cag get ctg acc 33 6 

Glu Ala Gin Gin Ala Leu Asn Trp He Gin Pro Asp Gin Ala Leu Thr 
95 100 105 110 

att aat ate aaa gag tec gtt gat ggc ctg gtt gac act ttg gcc ggc 384 
He Asn He Lys Glu Ser Val Asp Gly Leu Val Asp Thr Leu Ala Gly 

115 120 125 



432 



480 



gtg tta gga tct gac cat tea gcc gaa aat gta act ggc ttt ttc acc 



528 
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Val Leu Gly Ser Asp His Ser Ala Glu Asn Val Thr Gly Phe Phe Thr 
160 165 170 

aag cat ggg gac ggc get agt gac etc aac cct ctt ttc cgc eta aat 57 6 

Lys His Gly Asp Gly Ala Ser Asp Leu Asn Pro Leu Phe Arg Leu Asn 
175 180 185 190 

aaa cgt cag gga egg gee ctg ctt gag gaa tta ggg tec cct aag aac 624 
Lys Arg Gin Gly Arg Ala Leu Leu Glu Glu Leu Gly Ser Pro Lys Asn 

195 200 205 

ttg tac caa aag acc ccc aca get gat ttg gaa gaa gac cag ccc ggc 672 
Leu Tyr Gin Lys Thr Pro Thr Ala Asp Leu Glu Glu Asp Gin Pro Gly 

210 215 220 

ttg tea gat gaa gac aag tta ggg gtt tct tat gaa gee att gat gac 720 
Leu Ser Asp Glu Asp Lys Leu Gly Val Ser Tyr Glu Ala lie Asp Asp 
225 230 235 

tac ttg gag ggc aag cca gtt age cag gag gac cag gca acc ate gaa 7 68 

Tyr Leu Glu Gly Lys Pro Val Ser Gin Glu Asp Gin Ala Thr lie Glu 
240 245 250 

aaa tgg tat caa caa acg gee cac aag cgc cac ttg ccg gtg act ate 816 
Lys Trp Tyr Gin Gin Thr Ala His Lys Arg His Leu Pro Val Thr lie 
255 260 265 270 

ttt gat gat ttt tgg aaa gaa aaa aat tag 846 
Phe Asp Asp Phe Trp Lys Glu Lys Asn 

275 



<210> 50 
<211> 279 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 50 

Met Gly Asp Asp Leu Arg Glu Glu lie Leu Asp Arg Met Lys Val Gin 
15 10 15 



Ala Gin lie Asn Pro Asn Glu Glu lie Arg Arg Thr lie Asp Phe lie 

20 25 30 



Lys Asp Tyr Leu Gin Ala His Pro Phe Phe Glu Ser Leu lie Leu Gly 
35 40 45 



lie Ser Gly Gly Gin Asp Ser Thr Leu Leu Gly Lys Leu Ala Gin Met 
50 55 60 



Ala Cys Leu Glu Leu Arg Glu Glu Glu Gly Ser Asp Lys Pro lie Phe 
65 70 75 80 
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lie Gly lie Arg Leu Pro Tyr Gly Asp Gin Phe Asp Glu Ala Glu Ala 

85 90 95 



Gin Gin Ala Leu Asn Trp lie Gin Pro Asp Gin Ala Leu Thr lie Asn 

100 105 110 



lie Lys Glu Ser Val Asp Gly Leu Val Asp Thx Leu Ala Gly Gin Gly 
115 120 125 



lie Glu Val Ser Asp Phe Asn Lys Gly Asn Tie Lys Ala Arg lie Arg 
130 135 140 



Met Val Ala Gin Tyr Gly Val Ala Gly His Phe His Gly Ala Val Leu 
145 150 155 160 



Gly Ser Asp His Ser Ala Glu Asn Val Thr Gly Phe Phe Thr Lys His 

165 170 175 



Gly Asp Gly Ala Ser Asp Leu Asn Pro Leu Phe Arg Leu Asn Lys Arg 

180 185 190 



Gin Gly Arg Ala Leu Leu Glu Glu Leu Gly Ser Pro Lys Asn Leu Tyr 
195 200 205 



Gin Lys Thr Pro Thr Ala Asp Leu Glu Glu Asp Gin Pro Gly Leu Ser 
210 215 220 



Asp Glu Asp Lys Leu Gly Val Ser Tyr Glu Ala lie Asp Asp Tyr Leu 
225 230 235 240 



Glu Gly Lys Pro Val Ser Gin Glu Asp Gin Ala Thr lie Glu Lys Trp 

245 250 255 



Tyr Gin Gin Thr Ala His Lys Arg His Leu Pro Val Thr lie Phe Asp 

260 265 270 



Asp Phe Trp Lys Glu Lys Asn 
275 



<210> 51 
<211> 843 
<212> DNA 

<213> Alloiococcus otitidis 



WO 03/104391 
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<220> 

<221> CDS 

<222> (7) . . (843) 

<223> 

<400> 51 

aggaac atg att atg tat act gat ggg att ggc ttt att gat tea gga 48 
Met lie Met Tyr Thr Asp Gly He Gly Phe He Asp Ser Gly 
15 10 

gtg ggt ggc ttc acc ctg gtc aaa gaa gec atg aag caa ttg cca aat 96 
Val Gly Gly Phe Thr Leu Val Lys Glu Ala Met Lys Gin Leu Pro Asn 
15 20 25 30 

gaa caa ttt tac tat ctg gga gac acc gec egg tea cct tat gga cct 144 
Glu Gin Phe Tyr Tyr Leu Gly Asp Thr Ala Arg Ser Pro Tyr Gly Pro 

35 40 45 

aaa gac atg gec act gtc aag gca tat gec ttt gaa ctt gec aat tac 192 
-Lys Asp Met Ala Thr Val Lys Ala Tyr Ala Phe Glu Leu Ala Asn Tyr 

50 55 60 

ctg gtt aaa aac cac cag ate aaa ate ttg gtg ate get tgt aat act 240 
Leu Val Lys Asn His Gin He Lys He Leu Val He Ala Cys Asn Thr 
65 70 75 

gcg act gtc get gec etc aag gac eta aaa cag gec ttg ccc ate cca 288 
Ala Thr Val Ala Ala Leu Lys Asp Leu Lys Gin Ala Leu Pro lie Pro 
80 85 90 

gtt tta ggg gtc ate tta cct ggt tgc cga gca get att aag get agt 33 6 

Val Leu Gly Val He Leu Pro Gly Cys Arg Ala Ala He Lys Ala Ser 
95 100 105 110 

gtt aac cat cag att ggg gtt att gee acc cat ggg acc ate cag tec 3 84 

Val Asn His Gin He Gly Val He Ala Thr His Gly Thr He Gin Ser 

115 120 125 

ggt cgc tat gag ctt gaa ctt aaa egg aaa cga ccg gat att gaa gtg 432 
Gly Arg Tyr Glu Leu Glu Leu Lys Arg Lys Arg Pro Asp He Glu Val 

130 135 140 

aca agt ctg get tgt ccc gaa ttt gec ccc atg gta gag gcg gga gac 480 
Thr Ser Leu Ala Cys Pro Glu Phe Ala Pro Met Val Glu Ala Gly Asp 
145 150 155 

tac cga tct gtt caa get age agt gtg gtg agg aca tec tta cag gee 528 
Tyr Arg Ser Val Gin Ala Ser Ser Val Val Arg Thr Ser Leu Gin Ala 
160 165 . 170 

eta gaa gac caa gat ttg gat acc ctt att ttg ggt tgc acc cac tat 576 
Leu Glu Asp Gin Asp Leu Asp Thr Leu He Leu Gly Cys Thr His Tyr 
175 180 185 190 

ccc att ata aaa gac etc att caa gac tct att ggc cct ggt ate age 624 
Pro He He Lys Asp Leu He Gin Asp Ser He Gly Pro Gly He Ser 
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195 200 205 

ttg gtt gat cca ggg gcg gaa get gtg aat gac ttg agt gtc tta tta 

Leu Val Asp Pro Gly Ala Glu Ala Val Asn Asp Leu Ser Val Leu Leu 

210 215 220 

gac tat tat gac ttg act aat gac egg ttt aat ccc aac ctg acc cac 

Asp Tyr Tyr Asp Leu Thr Asn Asp Arg Phe Asn Pro Asn Leu Thr His 
225 230 235 



gag ttg.caa gaa gtt aat gga aga taa 
Glu Leu Gin Glu Val Asn Gly Arg 

275 



<210> 52 
<211> 278 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 52 

Met He Met Tyr Thr Asp Gly He Gly Phe He Asp Ser Gly Val Gly 
15 10 15 



Gly Phe Thr Leu Val Lys Glu Ala Met Lys Gin Leu Pro Asn Glu Gin 

20 25 30 



Phe Tyr Tyr Leu Gly Asp Thr Ala Arg Ser Pro Tyr Gly Pro Lys Asp 
35 40 45 



Met Ala Thr Val Lys Ala Tyr Ala Phe Glu Leu Ala Asn Tyr Leu Val 
50 55 60 



Lys Asn His Gin lie Lys He Leu Val He Ala Cys Asn Thr Ala Thr 
65 70 75 80 



Val Ala Ala Leu Lys Asp Leu Lys Gin Ala Leu Pro He Pro Val Leu 

85 90 95 



672 



720 



cat ttt tac acc acg gga gat aaa gec ggg ttt aag aaa ate gcg gat 768 
His Phe Tyr Thr Thr Gly Asp Lys Ala Gly Phe Lys Lys He Ala Asp 
240 245 250 

gac tgg ctt gac cac cac aac tac egg gtt gac cat tta gat tta gag 816 
Asp Trp Leu Asp His His Asn Tyr Arg Val Asp His Leu Asp Leu Glu 
255 260 265 270 



843 



Gly Val He Leu Pro Gly Cys Arg Ala Ala He Lys Ala Ser Val Asn 

100 105 110 
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His Gin He Gly Val He Ala Thr His Gly Thr He Gin Ser Gly Arg 
115 120 125 



Tyr Glu Leu Glu Leu Lys Arg Lys Arg Pro Asp He Glu Val Thr Ser 
130 135 140 



Leu Ala Cys Pro Glu Phe Ala Pro Met Val Glu Ala Gly Asp Tyr Arg 
145 150 155 160 



Ser Val Gin Ala Ser Ser Val Val Arg Thr Ser Leu Gin Ala Leu Glu 

165 170 175 



Asp Gin Asp Leu Asp Thr Leu He Leu Gly Cys Thr His Tyr Pro He 

180 185 190 



He Lys Asp Leu He Gin Asp Ser He Gly Pro Gly He Ser Leu Val 
195 200 205 



Asp Pro Gly Ala Glu Ala Val Asn Asp Leu Ser Val Leu Leu Asp Tyr 
210 215 220 



Tyr Asp Leu Thr Asn Asp Arg Phe Asn Pro Asn Leu Thr His His Phe 
225 230 235 240 



Tyr Thr Thr Gly Asp Lys Ala Gly Phe Lys Lys He Ala Asp Asp Trp 

245 250 255 



Leu Asp His His Asn Tyr Arg Val Asp His Leu Asp Leu Glu Glu Leu 

260 265 270 



Gin Glu Val Asn Gly Arg 
275 



<210> 53 
<211> 957 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (957) 

<223> 

<400> 53 

aaaaat atg acg aag gag tct tea ttt atg gtc aag acc aaa ata tgt 48 
Met Thr Lys Glu Ser Ser Phe Met Val Lys Thr Lys He Cys 
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15 10 

tct att tta aat ata aca ccg gat tea ttt tct gat ggt ggg cgc aac 96 
Ser lie Leu Asn lie Thr Pro Asp Ser Phe Ser Asp Gly Gly Arg Asn 
15 20 25 30 

tat cag gca gac caa gec ata get cac gga etc gac ttg gta gac aag 144 
Tyr Gin Ala Asp Gin Ala lie Ala His Gly Leu Asp Leu Val Asp Lys 

35 40 45 

gga gcg gac atg ttg gat att gga ggt gag teg acc egg cct ggt tec 192 
Gly Ala Asp Met Leu Asp lie Gly Gly Glu Ser Thr Arg Pro Gly Ser 

50 55 60 

agt cca gtc gac etc caa gat gaa ate gac cgt att gta ccg gtg ate 240 
Ser Pro Val Asp Leu Gin Asp Glu He Asp Arg He Val Pro Val He 
65 70 75 

aag gga ate aga gaa aaa agt cag gtt cct att tea gta gat acc tac 288 
Lys Gly He Arg Glu Lys Ser Gin Val Pro He Ser Val Asp Thr Tyr 
80 85 90 

egg get cca gtt gee aaa gcg get att gat get ggg gcg gat ate ate 33 6 

Arg Ala Pro Val Ala Lys Ala Ala He Asp Ala Gly Ala Asp He He 
95 100 105 110 

aat gat att acc ggt eta act ggt gat gta gac atg gec gac ttg eta 384 
Asn Asp He Thr Gly Leu Thr Gly Asp Val Asp Met Ala Asp Leu Leu 

115 120 125 

get caa gaa ggg gtt aag gec att gtc atg ttc aac ccg gtt att get 432 
Ala Gin Glu Gly Val Lys Ala He Val Met Phe Asn Pro Val He Ala 

130 135 140 

cga cct gac cac cca tct tec caa aaa ttc aga gat ttc ggg ggc cga 480 
Arg Pro Asp His Pro Ser Ser Gin Lys Phe Arg Asp Phe Gly Gly Arg 
145 150 155 • 

gat ttt ttc acc gat gaa gaa aga gat aaa atg tec caa gca ccc att 528 
Asp Phe Phe Thr Asp Glu Glu Arg Asp Lys Met Ser Gin Ala Pro He 
160 165 170 

gaa gag gee atg atg gtc tac ttt gac aaa gtc ttg aac aag gee cat 576 
Glu Glu Ala Met Met Val Tyr Phe Asp Lys Val Leu Asn Lys Ala His 
175 180 185 190 

caa get ggg att gac egg gat aag att tta ctg gac ccg gga att ggc 624 
Gin Ala Gly He Asp Arg Asp Lys He Leu Leu Asp Pro Gly He Gly 

195 200 205 

ttt ggc ctg acc aag aag gaa aat tac aag ttg att cac agt gtt gee 672 
Phe Gly Leu Thr Lys Lys Glu Asn Tyr Lys Leu He His Ser Val Ala 

210 215 220 

teg att cat gac aag ggc tac ccg gtc ttt tta gga gtt tec cgc aaa 720 
Ser He His Asp Lys Gly Tyr Pro Val Phe Leu Gly Val Ser Arg Lys 
225 230 235 



WO 03/104391 



111/235 



PCT/US02/36122 



cgc ttc ttg gtg ggg gaa gtc tec aag eta ggc ate gaa gee gac cca 7 68 

Arg Phe Leu Val Gly Glu Val Ser Lys Leu Gly lie Glu Ala Asp Pro 
240 245 250 

gag acc caa gca gga ttt tta aac cga gac ctg get tea get att att 816 
Glu Thr Gin Ala Gly Phe Leu Asn Arg Asp Leu Ala Ser Ala lie lie 
255 260 265 270 

aca get tac get age cat ata ggg gta gac tat gtc egg gtt cat tec 864 
Thr Ala Tyr Ala Ser His lie Gly Val Asp Tyr Val Arg Val His Ser 

275 280 285 

tta gat gaa cac aaa ata gca acc acc att acc cat aat att tta aac 912 
Leu Asp Glu His Lys lie Ala Thr Thr lie Thr His Asn lie Leu Asn 

290 295 300 

age gat age tta gat gat cag age ttt gac caa tat aaa aat taa 957 
Ser Asp Ser Leu Asp Asp Gin Ser Phe Asp Gin Tyr Lys Asn 
305 310 315 



<210> 54 
<211> 316 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 54 

Met Thr Lys Glu Ser Ser Phe Met Val Lys Thr Lys lie Cys Ser lie 
15 10 15 



Leu Asn lie Thr Pro Asp Ser Phe Ser Asp Gly Gly Arg Asn Tyr Gin 

20 25 30 



Ala Asp Gin Ala lie Ala His Gly Leu Asp Leu Val Asp Lys Gly Ala 
35 40 45 



Asp Met Leu Asp lie Gly Gly Glu Ser Thr Arg Pro Gly Ser Ser Pro 
50 55 60 



Val Asp Leu Gin Asp Glu lie Asp Arg lie Val Pro Val lie Lys Gly 
65 70 75 • 80 



He Arg Glu Lys Ser Gin Val Pro He Ser Val Asp Thr Tyr Arg Ala 

85 90 95 



Pro Val Ala Lys Ala Ala He Asp Ala Gly Ala Asp He He Asn Asp 

100 105 110 



He Thr Gly Leu Thr Gly Asp Val Asp Met Ala Asp Leu Leu Ala Gin 
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115 120 125 



Glu Gly Val Lys Ala lie Val Met Phe Asn Pro Val lie Ala Arg Pro 
130 135 140 



Asp His Pro Ser Ser Gin Lys Phe Arg Asp Phe Gly Gly Arg Asp Phe 
145 150 155 160 



Phe Thr Asp Glu Glu Arg Asp Lys Met Ser Gin Ala Pro lie Glu Glu 

165 170 175 



Ala Met Met Val Tyr Phe Asp Lys Val Leu Asn Lys Ala His Gin Ala 

180 185 190 



Gly He Asp Arg Asp Lys He Leu Leu Asp Pro Gly He Gly Phe Gly 
195 200 205 



Leu Thr Lys Lys Glu Asn Tyr Lys Leu He His Ser Val Ala Ser He 
210 215 220 



His Asp Lys Gly Tyr Pro Val Phe Leu Gly Val Ser Arg Lys Arg Phe 
225 230 235 240 



Leu Val Gly Glu Val Ser Lys Leu Gly He Glu Ala Asp Pro Glu Thr 

245 250 255 



Gin Ala Gly Phe Leu Asn Arg Asp Leu Ala Ser Ala He He Thr Ala 

260 265 270 



Tyr Ala Ser His lie Gly Val Asp Tyr Val Arg Val His Ser Leu Asp 
275 280 285 



Glu His Lys He Ala Thr Thr He Thr His Asn He Leu Asn Ser Asp 
290 295 300 



Ser Leu Asp Asp Gin Ser Phe Asp Gin Tyr Lys Asn 
305 310 315 



<210> 55 
<211> 561 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 
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<221> CDS 

<222> (28) . . (561) 

<223> 

<400> 55 

acaagaacta gacgaaaaat tagaacg ttg gga ata ttt aag cca ata tgt ata 54 

Met Gly lie Phe Lys Pro lie Cys He 
1 5 

gga gag ata act atg ata gcc tac gtt tgg gcc caa gat gag caa gga 102 
Gly Glu He Thr Met He Ala Tyr Val Trp Ala Gin Asp Glu Gin Gly 
10 15 20 25 

ate att ggt aaa gac aag gtt ttg cct tgg gaa ttg tec aat gac tta 150 
He He Gly Lys Asp Lys Val Leu Pro Trp Glu Leu Ser Asn Asp Leu 

30 35 40 

aag cat ttt aaa aaa gtt aca gaa ggt cac acc ate ctg atg ggc egg 198 
Lys His Phe Lys Lys Val Thr Glu Gly His Thr Tie Leu Met Gly Arg 

45 50 55 

aag acc ttt gaa gga atg gat aaa aag ccc etc cct aac cga aaa acc 246 
Lys Thr Phe Glu Gly Met Asp Lys Lys Pro Leu Pro Asn Arg Lys Thr 
60 65 70 

ttg gta ttg acc cgc caa gat gac tac caa get ggg gac gac cag gtt 294 
Leu Val Leu Thr Arg Gin Asp Asp Tyr Gin Ala Gly Asp Asp Gin Val 
75 80 85 

gaa gtc gtc cac tec aaa gac cag gcc ttg act tat gcg tea ggt cat 342 
Glu Val Val His Ser Lys Asp Gin Ala Leu Thr Tyr Ala Ser Gly His 
90 95 100 105 

ggg gtg gac etc tat gtg att ggt ggg gcc ggc att ttc gac ttg ttt 390 
Gly Val Asp Leu Tyr Val He Gly Gly Ala Gly He Phe Asp Leu Phe 

110 115 120 

ctg gac caa gtt gat gtt etc cac caa aca gtt ate cac gag age ttt 438 
Leu Asp Gin Val Asp Val Leu His Gin Thr Val He His Glu Ser Phe 

125 130 135 

gat ggt gac acc acc atg cca gac att gac tgg gac age ttt aat cag 486 
Asp Gly Asp Thr Thr Met Pro Asp He Asp Trp Asp Ser Phe Asn Gin 
140 145 150 

gtg tct aaa get tat tat gac cag get gac ggt cac aac cac tec cac 534 
Val Ser Lys Ala Tyr Tyr Asp Gin Ala Asp Gly His Asn His Ser His 
155 160 165 

acc att tat gaa tac aga aga aaa taa 561 
Thr He Tyr Glu Tyr Arg Arg Lys 
170 175 



<210> 56 
<211> 177 
<212* PRT 
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<213> Alloiococcus otitidis 
<400> 56 

Met Gly lie Phe Lys Pro lie Cys lie Gly Glu He Thr Met He Ala 
15 10 15 

Tyr Val Trp Ala Gin Asp Glu Gin Gly lie He Gly Lys Asp Lys Val 

20 25 30 



Leu Pro Trp Glu Leu Ser Asn Asp Leu Lys His Phe Lys Lys Val Thr 
35 40 45 



Glu Gly His Thr He Leu Met Gly Arg Lys Thr Phe Glu Gly Met Asp 
50 55 60 



Lys Lys Pro Leu Pro Asn Arg Lys Thr Leu Val Leu Thr Arg Gin Asp 
65 70 75 80 

Asp Tyr Gin Ala Gly Asp Asp Gin Val Glu Val Val His Ser Lys Asp 

85 90 95 



Gin Ala Leu Thr Tyr Ala Ser Gly His Gly Val Asp Leu Tyr Val He 

100 105 HO 



Gly Gly Ala Gly He Phe Asp Leu Phe Leu Asp Gin Val Asp Val Leu 
115 120 125 



His Gin Thr Val He His Glu Ser Phe Asp Gly Asp Thr Thr Met Pro 
130 135 140 



Asp He Asp Trp Asp Ser Phe Asn Gin Val Ser Lys Ala Tyr Tyr Asp 
145 150 155 160 



Gin Ala Asp Gly His Asn His Ser His Thr He Tyr Glu Tyr Arg Arg 

165 170 175 



Lys 



<210> 57 
<211> 1968 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 
<221> CDS 
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<222> (7) . . (1968) 
<223> 



<400> 57 

agagat ttg acg aag gaa tct tac aat gat tea tct ata acc ata etc 
Met Thr Lys Glu Ser Tyr Asn Asp Ser Ser lie Thr lie Leu 
1 5 10 

aag ggc tta gac gec gtt aag aaa aga cca ggc atg tat ate ggg tea 
Lvs Gly Leu Asp Ala Val Lys Lys Arg Pro Gly Met Tyr lie Gly Ser 
!5 20 25 30 

acc gat gee agg ggt ttg cac cac ctg gtt tat gaa att acc gat aat 
Thr Asp Ala Arg Gly Leu His His Leu Val Tyr Glu He Thr Asp Asn 

35 40 45 

get att gat gag gtt ttg get ggc tac get gat gaa att gaa gtc aag 
Ala He Asp Glu Val Leu Ala Gly Tyr Ala Asp Glu He Glu Val Lys 

50 55 60 

ate cac acg gac ggc teg gtt teg gtc aaa gac aat gga egg ggc atg 
He His Thr Asp Gly Ser Val Ser Val Lys Asp Asn Gly Arg Gly Met 
65 70 75 

cca acc ggg atg cat gag tea ggc eta ccc acc ate cag gtt ate ttt 
Pro Thr Gly Met His Glu Ser Gly Leu Pro Thr He Gin Val He Phe 
80 85 90 

acc gtc etc cat gec ggg gga aaa ttt ggc caa gag ggg gee tac aag 
Thr Val Leu His Ala Gly Gly Lys Phe Gly Gin Glu Gly Ala Tyr Lys 
95 100 105 110 

tea gee ggt gga etc cat ggg gtt ggg gee teg gtc gtc aac gee ttg 
Ser Ala Gly Gly Leu His Gly Val Gly Ala Ser Val Val Asn Ala Leu 

115 120 125 

tct gat tgg etc acg gtg ata gtg acc aag gac ggc tat gaa tac egg 
Ser Asp Trp Leu Thr Val He Val Thr Lys Asp Gly Tyr Glu Tyr Arg 

130 135 140 

caa gac ttt age caa gga ggc cag get aaa gga ggc ate cag aag aga 
Gin Asp Phe Ser Gin Gly Gly Gin Ala Lys Gly Gly He Gin Lys Arg 
145 150 155 

aaa att aac cag caa aaa tec age acc ctg gtc cac ttc aaa ccc tea 
Lys He Asn Gin Gin Lys Ser Ser Thr Leu Val His Phe Lys Pro Ser 
160 165 170 

ggc caa gtc ttt teg acc acc gaa ttt aac ttt aac acc ate tgt gag 
Gly Gin Val Phe Ser Thr Thr Glu Phe Asn Phe Asn Thr He Cys Glu 
175 180 185 190 

egg atg egg gag teg gec ttc ctt gtc aaa ggg acc aag att acc gta 
Arg Met Arg Glu Ser Ala Phe Leu Val Lys Gly Thr Lys He Thr Val 

195 200 205 

gag gac ctg cgc cag gaa gaa age cag gtc ttc caa ttt aat gaa gga 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 
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Glu Asp Leu Arg Gin Glu Glu Ser Gin Val Phe Gin Phe Asn Glu Gly 

210 215 220 

att aag gcc ttt gtc gac tac tta aat gag ggc aag gat acc ttg agt 
He Lys Ala Phe Val Asp Tyr Leu Asn Glu Gly Lys Asp Thr Leu Ser 
225 230 235 

cca gta acc tat ttt gaa ggt tct gaa gat gaa att gaa gtt gaa ttt 
Pro Val Thr Tyr Phe Glu Gly Ser Glu Asp Glu He Glu Val Glu Phe 
240 245 250 

gcc ttc caa tac aat gac ggc tat teg gag acg gtt ctg agt ttt gtc 
Ala Phe Gin Tyr Asn Asp Gly Tyr Ser Glu Thr Val Leu Ser Phe Val 
255 260 265 270 

aac aat gtc cgt acc egg gat ggg ggc age cac gaa act gga get aag 
Asn Asn Val Arg Thr Arg Asp Gly Gly Ser His Glu Thr Gly Ala Lys 

275 280 285 

tea get att acc aag get ttc aac gac tat get agg aaa agt ggc tta 
Ser Ala He Thr Lys Ala Phe Asn Asp Tyr Ala Arg Lys Ser Gly Leu 

290 295 300 

etc aaa gag aaa gac agt aac ttg gaa gga tct gac gtc egg gaa ggg 
Leu Lys Glu Lys Asp Ser Asn Leu Glu Gly Ser Asp Val Arg Glu Gly 
305 310 315 

att gcg gtt gtt tta tec gtc cgt ate cca gaa gag att etc caa ttt 
He Ala Val Val Leu Ser .Val Arg He Pro Glu Glu He Leu Gin Phe 
320 325 330 

gaa ggc cag acc aag age aag tta gga act cct caa gcc egg acc gcc 
Glu Gly Gin Thr Lys Ser Lys Leu Gly Thr Pro Gin Ala Arg Thr Ala 
335 340 345 350 

act gac cag gtt ate tea gaa tec tta act tac ttc ctg gcc gaa aat 
Thr Asp Gin Val He Ser Glu Ser Leu Thr Tyr Phe Leu Ala Glu Asn 

355 360 365 



gcc agg gaa gca get cgc aag gcc aag gac cag tec egg aac tct get 
Ala Arg Glu Ala Ala Arg Lys Ala Lys Asp Gin Ser Arg Asn Ser Ala 
385 390 395 

tec aag aaa aaa gtt gaa act etc ctg tct ggt aag ttg acc cca get 
Ser Lys Lys Lys Val Glu Thr Leu Leu Ser Gly Lys Leu Thr Pro Ala 
400 405 410 

caa age aag aac gcc cag aaa aat gaa ctt tac tta gtg gag ggg gat 
Gin Ser Lys Asn Ala Gin Lys Asn Glu Leu Tyr Leu Val Glu Gly Asp 
415 420 425 430 

teg get ggt ggg tea gcc aag caa ggt agg gac egg aaa ttc caa gca 
Ser Ala Gly Gly Ser Ala Lys Gin Gly Arg Asp Arg Lys Phe Gin Ala 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



ggg gac ttg tct aag caa ctt att cgc aag gcc ate cga gcc egg tct 1152 
Gly Asp Leu Ser Lys Gin Leu He Arg Lys Ala He Arg Ala Arg Ser 

370 375 380 



1200 



1248 



1296 



1344 



WO 03/104391 



117/235 



PCTYUS02/36122 



435 440 445 

att ttg ccc ctg cgt gga aag gtt ate aac aca gaa aaa tct tct ttg 
He Leu Pro Leu Arg Gly Lys Val He Asn Thr Glu Lys Ser Ser Leu 

450 455 460 

gat gat att tta aaa aat gaa gaa att tct acc atg att tat acc ate 
Asp Asp He Leu Lys Asn Glu Glu He Ser Thr Met He Tyr Thr He 
465 470 475 

ggt gca ggt get ggg cct gag ttt gat att gaa get gtt aat tac gat 
Gly Ala Gly Ala Gly Pro Glu Phe Asp He Glu Ala Val Asn Tyr Asp 
480 485 490 

aag ata gtc att atg act gat gee gac aca gac ggc gec cac ate cag 
Lys He Val lie Met Thr Asp Ala Asp Thr Asp Gly Ala His He Gin 
495 500 505 510 

gtc ctt etc etc acc ttc ttt tac egg tac atg aaa ccc ctg att gaa 
Val Leu Leu Leu Thr Phe Phe Tyr Arg Tyr Met Lys Pro Leu He Glu 

515 520 525 

gca ggg aag gtc tat att gee eta ccg ccc ttg tat aag ttg acc aaa 
Ala Gly Lys Val Tyr He Ala Leu Pro Pro Leu Tyr Lys Leu Thr Lys 

530 535 540 

aag caa gga aag caa gaa aaa aca gee tat get tgg act gat gag gag 
Lys Gin Gly Lys Gin Glu Lys Thr Ala Tyr Ala Trp Thr Asp Glu Glu 
545 550 555 

ttg gaa gac ctg gtt aaa gat ttt ggc aaa cac tac act etc cag cgc 
Leu Glu Asp Leu Val Lys Asp Phe Gly Lys His Tyr Thr Leu Gin Arg 
560 565 570 



atg gac cca gag acc aga acc ttg ate egg gtc acc att gaa gac agt 
Met Asp Pro Glu Thr Arg Thr Leu He Arg Val Thr He Glu Asp Ser 

595 600 605 



cct aga egg aag tgg att gaa gac cat att gaa ttc agt ctg gca gaa 

Pro Arg Arg Lys Trp He Glu Asp His He Glu Phe Ser Leu Ala Glu 
625 630 635 

gat ggc agt att tta gag aac aag gtc eta gaa gga gag gec aag taa 

Asp Gly Ser He Leu Glu Asn Lys Val Leu Glu Gly Glu Ala Lys 
640 645 650 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



tac aag ggt tta ggc gag atg aat get gac cag ttg tgg gag acc acc 1776 
Tyr Lys Gly Leu Gly Glu Met Asn Ala Asp Gin Leu Trp Glu Thr Thr 
575 580 585 590 



1824 



gaa aag get gaa aga egg gtt tec acc ttg atg ggg acc aag gtg gat 1872 
Glu Lys Ala Glu Arg Arg Val Ser Thr Leu Met Gly Thr Lys Val Asp 

610 615 620 



1920 



1968 



<210> 58 
<211> 653 
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<212> PRT 

<213> Alloiococcus otitidis 
<400> 58 

Met Thr Lys Glu Ser Tyr Asn Asp Ser Ser lie Thr lie Leu Lys Gly 
15 10 15 



Leu Asp Ala Val Lys Lys Arg Pro Gly Met Tyr He Gly Ser Thr Asp 

20 25 30 



Ala Arg Gly Leu His His Leu Val Tyr Glu He Thr Asp Asn Ala He 
35 40 45 



Asp Glu Val Leu Ala Gly Tyr Ala Asp Glu He Glu Val Lys He His 
50 55 60 



Thr Asp Gly Ser Val Ser Val Lys Asp Asn Gly Arg Gly Met Pro Thr 
65 70 75 80 



Gly Met His Glu Ser Gly Leu Pro Thr He Gin Val He Phe Thr Val 

85 90 95 



Leu His Ala Gly Gly Lys Phe Gly Gin Glu Gly Ala Tyr Lys Ser Ala 

100 105 110 



Gly Gly Leu His Gly Val Gly Ala Ser Val Val Asn Ala Leu Ser Asp 
115 120 125 



Trp Leu Thr Val He Val Thr Lys Asp Gly Tyr Glu Tyr Arg Gin Asp 
130 135 140 



Phe Ser Gin Gly Gly Gin Ala Lys Gly Gly He Gin Lys Arg Lys He 
145 150 155 160 



Asn Gin Gin Lys Ser Ser Thr Leu Val His Phe Lys Pro Ser Gly Gin 

165 170 175 



Val Phe Ser Thr Thr Glu Phe Asn Phe Asn Thr He Cys Glu Arg Met 

180 185 190 



Arg Glu Ser Ala Phe Leu Val Lys Gly Thr Lys He Thr Val Glu Asp 
195 200 205 



Leu Arg Gin Glu Glu Ser Gin Val Phe Gin Phe Asn Glu Gly He Lys 
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210 



215 220 



Ala Phe Val Asp Tyr Leu Asn Glu Gly Lys Asp Thr Leu Ser Pro Val 
225 230 235 240 

Thr Tyr Phe Glu Gly Ser Glu Asp Glu lie Glu Val Glu Phe Ala Phe 

245 250 255 

Gin Tyr Asn Asp Gly Tyr Ser Glu Thr Val Leu Ser Phe Val Asn Asn 

260 265 270 

Val Arg Thr Arg Asp Gly Gly Ser His Glu Thr Gly Ala Lys Ser Ala 
275 280 285 

He Thr Lys Ala Phe Asn Asp Tyr Ala Arg Lys Ser Gly Leu Leu Lys 
290 295 300 

Glu Lys Asp Ser Asn Leu Glu Gly Ser Asp Val Arg Glu Gly He Ala 
305 310 315 320 

Val Val Leu Ser Val Arg He Pro Glu Glu He Leu Gin Phe Glu Gly 

325 330 335 

Gin Thr Lys Ser Lys Leu Gly Thr Pro Gin Ala Arg Thr Ala Thr Asp 

340 345 350 

Gin Val He Ser Glu Ser Leu Thr Tyr Phe Leu Ala Glu Asn Gly Asp 
355 360 365 

Leu Ser Lys Gin Leu He Arg Lys Ala He Arg Ala Arg Ser Ala Arg 
370 375 380 

Glu Ala Ala Arg Lys Ala Lys Asp Gin Ser Arg Asn Ser Ala Ser Lys 
385 390 395 400 

Lys Lys Val Glu Thr Leu Leu Ser Gly Lys Leu Thr Pro Ala Gin Ser 

405 410 415 

Lys Asn Ala Gin Lys Asn Glu Leu Tyr Leu Val Glu Gly Asp Ser Ala 

420 425 430 

Gly Gly Ser Ala Lys Gin Gly Arg Asp Arg Lys Phe Gin Ala He Leu 
435 440 445 
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Pro Leu Arg Gly Lys Val lie Asn Thr Glu Lys Ser Ser Leu Asp Asp 
450 455 460 



lie Leu Lys Asn Glu Glu lie Ser Thr Met lie Tyr Thr lie Gly Ala 
465 470 475 480 



Gly Ala Gly Pro Glu Phe Asp lie Glu Ala Val Asn Tyr Asp Lys lie 

485 490 495 



Val He Met Thr Asp Ala Asp Thr Asp Gly Ala His He Gin Val Leu 

500 505 510 



Leu Leu Thr Phe Phe Tyr Arg Tyr Met Lys Pro Leu He Glu Ala Gly 
515 520 525 



Lys Val Tyr He Ala Leu Pro Pro Leu Tyr Lys Leu Thr Lys Lys Gin 
530 535 540 



Gly Lys Gin Glu Lys Thr Ala Tyr Ala Trp Thr Asp Glu Glu Leu Glu 
545 550 555 560 



Asp Leu Val Lys Asp Phe Gly Lys His Tyr Thr Leu Gin Arg Tyr Lys 

565 570 575 



Gly Leu Gly Glu Met Asn Ala Asp Gin Leu Trp Glu Thr Thr Met Asp 

580 ^ 585 590 



Pro Glu Thr Arg Thr Leu He Arg Val Thr He Glu Asp Ser Glu Lys 
595 600 605 



Ala Glu Arg Arg Val Ser Thr Leu Met Gly Thr Lys Val Asp Pro Arg 
610 615 620 



Arg Lys Trp He Glu Asp His He Glu Phe Ser Leu Ala Glu Asp Gly 
625 630 635 640 



Ser He Leu Glu Asn Lys Val Leu Glu Gly Glu Ala Lys 

645 650 



<210> 59 
<211> 2463 
<212> DNA 
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<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . - (2463) 

<223> 

<400> 59 

att atg gca gga gac caa gag acc agt aaa ata caa gaa tta acc tta 

Met Ala Gly Asp Gin Glu Thr Ser Lys He Gin Glu Leu Thr Leu 

15 10 15 

gaa gat gtc atg ggg gac egg ttc ggc egg tat tec aag tac att ata 
Glu Asp Val Met Gly Asp Arg Phe Gly Arg Tyr Ser Lys Tyr He He 

20 25 30 

cag gaa agg gec eta ccg gac ttg egg gac ggt tta aaa ccg gtc caa 
Gin Glu Arg Ala Leu Pro Asp Leu Arg Asp Gly Leu Lys Pro Val Gin 

35 40 45 

aga egg ate etc tat gee atg cac cag gac aaa aac acc tat gac aag 
Arg Arg He Leu Tyr Ala Met His Gin Asp Lys Asn Thr Tyr Asp Lys 
50 55 60 

get tac egg aag teg gec aag acg gtg gga aat gtc ata ggg aac tac 
Ala Tyr Arg Lys Ser Ala Lys Thr Val Gly Asn Val He Gly Asn Tyr 
65 70 75 

cac ccc cat ggc gac aca tec gtt tac gat gee atg gtt agg etc agt 
His Pro His Gly Asp Thr Ser Val Tyr Asp Ala Met Val Arg Leu Ser 
80 85 90 95 



ggg age atg gac ggg gac cca cca get gec atg egg tac acc gaa gec 
Gly Ser Met Asp Gly Asp Pro Pro Ala Ala Met Arg Tyr Thr Glu Ala 

115 120 125 

cgt ctg tct aaa att get tec gac etc ctg get gat att gat aag gag 
Arg Leu Ser Lys He Ala Ser Asp Leu Leu Ala Asp He Asp Lys Glu 
130 135 140 

acg gtg gac cat gtc tta aac ttt gat gac acg acc gag gag ccc acc 
Thr Val Asp His Val Leu Asn Phe Asp Asp Thr Thr Glu Glu Pro Thr 
145 150 155 



att tea gee ggt tat get act gac ata ccg ccc cat aat ttg age gag 
He Ser Ala Gly Tyr Ala Thr Asp He Pro Pro His Asn Leu Ser Glu 

180 185 190 



48 



96 



144 



192 



240 



288 



cag cct tgg aag atg cgc cat cct ttg gtt gat atg cac ggg aac aag 336 
Gin Pro Trp Lys Met Arg His Pro Leu Val Asp Met His Gly Asn Lys 

100 105 110 



384 



432 



480 



gtc tta ccc gee cgt ttt ccc aac etc ttg gtc aat ggg get age ggg 528 
Val Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser Gly 
160 165 170 175 



576 



gtg att gat gee acc ate cac tta ate aac cac ccc aat gca agg ctg 



624 
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att gtg gtc gaa acc aaa aaa gat ggt gat ggg gaa ggg ate tta acc 
He Val Val Glu Thr Lys Lys Asp Gly Asp Gly Glu Gly He Leu Thr 
305 310 315 

tac ctg ctg aaa aac acc gac etc cag gta act tat aac tta aat atg 
Tyr Leu Leu Lys Asn Thr Asp Leu Gin Val Thr Tyr Asn Leu Asn Met 
320 325 330 335 

gta gec att gat aaa aaa cga ccc cag caa gtc tec etc aag caa ate 
Val Ala He Asp Lys Lys Arg Pro Gin Gin Val Ser Leu Lys Gin He 

340 345 350 

tta tct tct tac ttg gac cac aag egg aca gtg gtt caa aac egg acc 
Leu Ser Ser Tyr Leu Asp His Lys Arg Thr Val Val Gin Asn Arg Thr 

355 360 365 

cgt tac etc tta gec aag gec aag gac cgc cag cac att gtc caa ggc 
Arg Tyr Leu Leu Ala Lys Ala Lys Asp Arg Gin His He Val Gin Gly 
370 . 375 380 

ctt ate aag gec att tea ate ctg gat gac ttg ate caa acc ate egg 
Leu He Lys Ala He Ser He Leu Asp Asp Leu He Gin Thr lie Arg 
385 390 ■ 395 

gee agt gaa aac aag gee aat gee aag gaa aat att ate cag get tat 
Ala Ser Glu Asn Lys Ala Asn Ala Lys Glu Asn He He Gin Ala Tyr 
400 405 410 415 

ggt ttt age caa gac caa gee gaa gec att gtc tec etc cag ctt tac 
Gly Phe Ser Gin Asp Gin Ala Glu Ala He Val Ser Leu Gin Leu Tyr 



672 



720 



768 



Val He Asp Ala Thr He His Leu He Asn His Pro Asn Ala Arg Leu 

195 200 205 

gag act ttg atg gac tat att caa gga cca gac ttt ccg act ggg ggg 
Glu Thr Leu Met Asp Tyr He Gin Gly Pro Asp Phe Pro Thr Gly Gly 
210 215 220 

att ate caa ggt aaa agt ggc ctg aag aaa gec tac caa acg ggc aag 
He He Gin Gly Lys Ser Gly Leu Lys Lys Ala Tyr Gin Thr Gly Lys 
225 230 235 

gga aaa att ate ate egg gee aaa gca gat att gag gec ate egg ggt 
Gly Lys He He He Arg Ala Lys Ala Asp He Glu Ala He Arg Gly 
240 245 250 255 

ggc aaa tec caa att gtc ate agt caa att cct tat gag gtc aac aag 
Gly Lys Ser Gin He Val He Ser Gin He Pro Tyr Glu Val Asn Lys 

260 265 270 

gca agg ttg gtc caa aaa att gac gac ate egg att aac aaa aaa ate 
Ala Arg Leu Val Gin Lys He Asp Asp He Arg He Asn Lys Lys He 

275 280 285 

gac ggc att gee gat gtc egg gat gaa agt gac egg tct ggc ttg egg 912 
Asp Gly He Ala Asp Val Arg Asp Glu Ser Asp Arg Ser Gly Leu Arg 
290 295 300 



816 



864 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 
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420 425 430 

cgc ttg acc aat aca gat ata aag gac tta caa gca gaa gcc aaa gac 
Arg Leu Thr Asn Thr Asp He Lys Asp Leu Gin Ala Glu Ala Lys Asp 

435 440 445 

tta gcc caa gcc ate ctg acc tac cag gac etc tta acc aac aag gcc 
Leu Ala Gin Ala He Leu Thr Tyr Gin Asp Leu Leu Thr Asn Lys Ala 
450 455 460 

age ctg gat get ttg atg aaa gaa gaa ttg aaa gaa gtc aaa caa gca 
Ser Leu Asp Ala Leu Met Lys Glu Glu Leu Lys Glu Val Lys Gin Ala 
465 470 475 

tat ggg gag gac egg eta acc cag gtc caa gac aag ate gaa aaa eta 
Tyr Gly Glu Asp Arg Leu Thr Gin Val Gin Asp Lys He Glu Lys Leu 
480 485 490 495 



gtc acc cag gga ggt tac ttg aag egg acc tec ate egg tct tac aag 
Val Thr Gin Gly Gly Tyr Leu Lys Arg Thr Ser He Arg Ser Tyr Lys 

515 520 525 



ttt atg caa gag ttg tea acc eta gac caa etc ctt att ttc acc teg 
Phe Met Gin Glu Leu Ser Thr Leu Asp Gin Leu Leu He Phe Thr Ser 
545 550 555 

aaa ggc aat gtg gtc aac cga cca gtc cat gaa tta ccg gac ate aag 
Lys Gly Asn Val Val Asn Arg Pro Val His Glu Leu Pro Asp He Lys 
560 565 570 575 

tgg aag gat att gga gag cac ttg tea agg acc ate ccc ctt gga gag 
Trp Lys Asp He Gly Glu His Leu Ser Arg Thr He Pro Leu Gly Glu 

580 585 590 

gac gag gaa ttg att aag gtg tac cct tat egg gaa tta gat gcc ggc 
Asp Glu Glu Leu He Lys Val Tyr * Pro Tyr Arg Glu Leu Asp Ala Gly 

595 600 605 

aag cgc tat gtc ttt ate act cga gat ggc tat ate aaa caa agt cca 
Lys Arg Tyr Val Phe He Thr Arg Asp Gly Tyr He Lys Gin Ser Pro 
610 615 620 

gag acg gaa ttt gag ccc aaa cga act tac aag tct egg get tea act 
Glu Thr Glu Phe Glu Pro Lys Arg Thr Tyr Lys Ser Arg Ala Ser Thr 
625 630 635 

gcc att aaa tta aaa tea gac caa gat aga etc cag gca gtc tac tat 
Ala He Lys Leu Lys Ser Asp Gin Asp Arg Leu Gin Ala Val Tyr Tyr 
640 645 650 655 



1344 



1392 



1440 



1488 



gaa ata gaa acc caa gtc ctg gtc agt gaa gaa gac gtc atg gtt acc 153 6 

Glu He Glu Thr Gin Val Leu Val Ser Glu Glu Asp Val Met Val Thr 

500 505 510 



1584 



get tec caa gtg gag gaa ttg ggc egg cga gaa gac gac ttg gtc ate 1632 
Ala Ser Gin Val Glu Glu Leu Gly Arg Arg Glu Asp Asp Leu Val He 
530 535 540 



1680 



1728 



1776 



1824 



1872 



1920 



1968 
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att cct gac caa gaa gat tac gat gta ttc eta gec age tac aag ggc 
He Pro Asp Gin Glu Asp Tyr Asp Val Phe Leu Ala Ser Tyr Lys Gly 

660 665 670 

tac ggg etc aag tat gga eta gaa gaa gtg tea gaa gta ggg gee cag 
Tyr Gly Leu Lys Tyr Gly Leu Glu Glu Val Ser Glu Val Gly Ala Gin 

675 680 685 

get gca ggc gtc aag tec atg aac ctg aaa gag ggg gac cat gtc caa 
Ala Ala Gly Val Lys Ser Met Asn Leu Lys Glu Gly Asp His Val Gin 
690 695 700 

gat ggt ttg gtc ttt aag cgt aag cag ttc caa gaa gec ttg ttc att 
Asp Gly Leu Val Phe Lys Arg Lys Gin Phe Gin Glu Ala Leu Phe He 
705 710 715 

acc cag cga gee agt gtt aag aaa atg gee etc cat gac ttt gac egg 
Thr Gin Arg Ala Ser Val Lys Lys Met Ala Leu His Asp Phe Asp Arg 
720 725 730 735 

act tea egg gec aag egg ggt tta caa ate etc aga gaa ctg aag cga 
Thr Ser Arg Ala Lys Arg Gly Leu Gin He Leu Arg Glu Leu Lys Arg 

740 745 750 

aac ccc cac cga ate cag ttt atg ate gga att tea caa aat aaa ttc 
Asn Pro His Arg lie Gin Phe Met He Gly He Ser Gin Asn Lys Phe 

755 760 765 

ctg gtc aat etc eta act gat aca aaa aaa eta gta cag ata aac cca 
Leu Val Asn Leu Leu Thr Asp Thr Lys Lys Leu Val Gin He Asn Pro 
770 775 780 

gat gac tat aca gtt tea aac cgc cat aac aat ggg tct ttt gtc ctg 
Asp Asp Tyr Thr Val Ser Asn Arg His Asn Asn Gly Ser Phe Val Leu 
785 790 795 

gac aca age cga gat ggc aag cct gtt tct tac tat tta agt gat aac 
Asp Thr Ser Arg Asp Gly Lys Pro Val Ser Tyr Tyr Leu Ser Asp Asn 
800 805 810 815 

gat tct cac ttg taa 
Asp Ser His Leu 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2463 



<210> 60 
<211> 819 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 60 

Met Ala Gly Asp Gin Glu Thr Ser Lys He Gin Glu Leu Thr Leu Glu 
! 5 10 15 



Asp Val Met Gly Asp Arg Phe Gly Arg Tyr Ser Lys Tyr He He 

20 25 30 
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Glu Arg Ala Leu Pro Asp Leu Arg Asp Gly Leu Lys Pro Val Gin Arg 
35 40 45 



Arg lie Leu Tyr Ala Met His Gin Asp Lys Asn Thr Tyr Asp Lys Ala 
50 55 60 



Tyr Arg Lys Ser Ala Lys Thr Val Gly Asn Val He Gly Asn Tyr His 
65 70 75 80 



Pro His Gly Asp Thr Ser Val Tyr Asp Ala Met Val Arg Leu Ser Gin 

85 90 95 



Pro Trp Lys Met Arg His Pro Leu Val Asp Met His Gly Asn Lys Gly 

100 105 110 



Ser Met Asp Gly Asp Pro Pro Ala Ala Met Arg Tyr Thr Glu Ala Arg 
115 120 125 



Leu Ser Lys He Ala Ser Asp Leu Leu Ala Asp He Asp Lys Glu Thr 
130 135 140 



Val Asp His Val Leu Asn Phe Asp Asp Thr Thr Glu Glu Pro Thr Val 
145 150 155 160 



Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser Gly He 

165 170 175 



Ser Ala Gly Tyr Ala Thr Asp He Pro Pro His Asn Leu Ser Glu Val 

180 185 190 



He Asp Ala Thr He His Leu He Asn His Pro Asn Ala Arg Leu Glu 
195 200 205 



Thr Leu Met Asp Tyr He Gin Gly Pro Asp Phe Pro Thr Gly Gly He 
210 • 215 220 



He Gin Gly Lys Ser Gly Leu Lys Lys Ala Tyr Gin Thr Gly Lys Gly 
225 230 235 240 



Lys He He He Arg Ala Lys Ala Asp He Glu Ala He Arg Gly Gly 

245 250 255 
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Lys Ser Gin lie Val lie Ser Gin He Pro Tyr Glu Val Asn Lys Ala 

260 265 270 



Arg Leu Val Gin Lys He Asp Asp He Arg He Asn Lys Lys He Asp 
275 280 285 



Gly He Ala Asp Val Arg Asp Glu Ser Asp Arg Ser Gly Leu Arg He 
290 295 300 



Val Val Glu Thr Lys Lys Asp Gly Asp Gly Glu Gly He Leu Thr Tyr 
305 310 315 320 



Leu Leu Lys Asn Thr Asp Leu Gin Val Thr Tyr Asn Leu Asn Met Val 

325 330 335 



Ala He Asp Lys Lys Arg Pro Gin Gin Val Ser Leu Lys Gin He Leu 

340 345 350 



Ser Ser Tyr Leu Asp His Lys Arg Thr Val Val Gin Asn Arg Thr Arg 
355 360 365 



Tyr Leu Leu Ala Lys Ala Lys Asp Arg Gin His He Val Gin Gly Leu 
370 375 380 



He Lys Ala He Ser He Leu Asp Asp Leu He Gin Thr He Arg Ala 
385 390 395 400 



Ser Glu Asn Lys Ala Asn Ala Lys Glu Asn He He Gin Ala Tyr Gly 

405 410 415 



Phe Ser Gin Asp Gin Ala Glu Ala He Val Ser Leu Gin Leu Tyr Arg 

420 425 430 



Leu Thr Asn Thr Asp He Lys Asp Leu Gin Ala Glu Ala Lys Asp Leu 
435 440 445 



Ala Gin Ala He Leu Thr Tyr Gin Asp Leu Leu Thr Asn Lys Ala Ser 
450 455 460 



Leu Asp Ala Leu Met Lys Glu Glu Leu Lys Glu Val Lys Gin Ala Tyr 
465 470 475 480 
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Gly Glu Asp Arg Leu Thr Gin Val Gin Asp Lys lie Glu Lys Leu Glu 

485 490 495 

lie Glu Thr Gin Val Leu Val Ser Glu Glu Asp Val Met Val Thr Val 

500 505 510 

Thr Gin Gly Gly Tyr Leu Lys Arg Thr Ser lie Arg Ser Tyr Lys Ala 
515 520 525 

Ser Gin Val Glu Glu Leu Gly Arg Arg Glu Asp Asp Leu Val lie Phe 
530 535 540 

Met Gin Glu Leu Ser Thr Leu Asp Gin Leu Leu lie Phe Thr Ser Lys 
545 550 555 560 

Gly Asn Val Val Asn Arg Pro Val His Glu Leu Pro Asp lie Lys Trp 

565 570 575 

Lys Asp lie Gly Glu His Leu Ser Arg Thr He Pro Leu Gly Glu Asp 

580 585 590 

Glu Glu Leu He Lys Val Tyr Pro Tyr Arg Glu Leu Asp Ala Gly Lys 
595 600 605 

Arg Tyr Val Phe He Thr Arg Asp Gly Tyr He Lys Gin Ser Pro Glu 
610 615 620 

Thr Glu Phe Glu Pro Lys Arg Thr Tyr Lys Ser Arg Ala Ser Thr Ala 
625 630 635 640 

He Lys Leu Lys Ser Asp Gin Asp Arg Leu Gin Ala Val Tyr Tyr He 

645 650 655 

Pro Asp Gin Glu Asp Tyr Asp Val Phe Leu Ala Ser Tyr Lys Gly Tyr 

660 665 670 

Gly Leu Lys Tyr Gly Leu Glu Glu Val Ser Glu Val Gly Ala Gin Ala 
675 680 685 

Ala Gly Val Lys Ser Met Asn Leu Lys Glu Gly Asp His Val Gin Asp 
690 695 700 

Gly Leu Val Phe Lys Arg Lys Gin Phe Gin Glu Ala Leu Phe He Thr 
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705 710 715 720 



Gin Arg Ala Ser Val Lys Lys Met Ala Leu His Asp Phe Asp Arg Thr 

725 730 735 



Ser Arg Ala Lys Arg Gly Leu Gin He Leu Arg Glu Leu Lys Arg Asn 

740 745 750 



Pro His Arg He Gin Phe Met He Gly He Ser Gin Asn Lys Phe Leu 
755 760 765 



Val Asn Leu Leu Thr Asp Thr Lys Lys Leu Val Gin He Asn Pro Asp 
770 775 780 



Asp Tyr Thr Val Ser Asn Arg His Asn Asn Gly Ser Phe Val Leu Asp 
785 790 795 800 



Thr Ser Arg Asp Gly Lys Pro Val Ser Tyr Tyr Leu Ser Asp Asn Asp 

805 810 815 



Ser His Leu 



<210> 61 
<211> 1113 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (1113) 

<223> 

<400> 61 

tta gtg gtt gag aca aaa tea aaa eta gaa aat gca gta aac acc etc 48 
Met Val Glu Thr Lys Ser Lys Leu Glu Asn Ala Val Asn Thr Leu 
15 10 15 

att aaa gac ttg aaa aat aaa aaa gag teg acc att tct tat att gac 96 
He Lys Asp Leu Lys Asn Lys Lys Glu Ser Thr He Ser Tyr He Asp 

20 25 30 

etc age aac aaa att get gaa ccc ttc gaa ctt gaa agt gaa gec atg 144 
Leu Ser Asn Lys He Ala Glu Pro Phe Glu Leu Glu Ser Glu Ala Met 

35 40 45 

gac aag tta ate cag caa tta gaa gat gat ggg att ggt gta gtt gac 192 
Asp Lys Leu He Gin Gin Leu Glu Asp Asp Gly He Gly Val Val Asp 
50 55 60 
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caa gac ggt aat ccc ttg gcc aag caa eta gec aag cag gaa gaa gaa 
Gin Asp Gly Asn Pro Leu Ala Lys Gin Leu Ala Lys Gin Glu Glu Glu 
65 70 75 

gca gaa aaa gcc aag gat gaa gaa atg ata gcc cca cct ggg gtt aaa 
Ala Glu Lys Ala Lys Asp Glu Glu Met lie Ala Pro Pro Gly Val Lys 
80 85 90 95 



240 



ctt tta gat get gaa gaa gaa gtg gcc eta gcc aag egg att gaa gaa 
Leu Leu Asp Ala Glu Glu Glu Val Ala Leu Ala Lys Arg lie Glu Glu 

115 120 125 

ggc gat gaa ate get aaa caa gaa eta get gag get aac ttg aga ctg 
Gly Asp Glu lie Ala Lys Gin Glu Leu Ala Glu Ala Asn Leu Arg Leu 
130 135 140 

gtt gtc tct att get aaa egg tac gtt ggc egg ggc atg age ttt ttg 
Val Val Ser lie Ala Lys Arg Tyr Val Gly Arg Gly Met Ser Phe Leu 
145 150 155 

gac ttg ate cag gaa ggg aat atg ggg eta atg aag gca gtt gaa aaa 
Asp Leu lie Gin Glu Gly Asn Met Gly Leu Met Lys Ala Val Glu Lys 
160 165 170 175 

ttt gac tac gaa aaa ggt ttc aaa ttt tea acc tat gcc acc tgg tgg 
Phe Asp Tyr Glu Lys Gly Phe Lys Phe Ser Thr Tyr Ala Thr Trp Trp 

180 185 190 

ate cgt caa gcc ate act egg gcc att gcc gac caa gcc cga acc ate 
lie Arg Gin Ala lie Thr Arg Ala He Ala Asp Gin Ala Arg Thr He 

195 200 205 

egg att ccg gtc cac atg gtc gaa act att aac aag ctg gtc cga ate 
Arg He Pro Val His Met Val Glu Thr He Asn Lys Leu Val Arg He 
210 215 220 

cag egg cag etc eta caa gaa eta ggc egg gaa cca acc cca gaa gaa 
Gin Arg Gin Leu Leu Gin Glu Leu Gly Arg Glu Pro Thr Pro Glu Glu 
225 230 235 

att ggg gca gag atg gat ttg cca acc gaa aaa gtc aga gat att ttg 
He Gly Ala Glu Met Asp Leu Pro Thr Glu Lys Val Arg Asp He Leu 
240 245 250 255 

aaa att tec caa gaa ccc gtc tec ctt gaa acc cca att ggg gaa gaa 
Lys He Ser Gin Glu Pro Val Ser Leu Glu Thr Pro He Gly Glu Glu 

260 265 270 

gaa gat tec cac ctg gga gac ttt att gaa gat gat ggg gcc ttg teg 
Glu Asp Ser His Leu Gly Asp Phe He Glu Asp Asp Gly Ala Leu Ser 

275 280 285 



288 



att aac gac cct gtc egg atg tac eta aaa gaa att ggc egg gta gat 33 6 

He Asn Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Arg Val Asp 

100 105 110 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



cca tct gat aat gca get tat gag ctg ttg aaa ggg gaa etc aaa gga 



912 
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Pro Ser Asp Asn Ala Ala Tyr Glu Leu Leu Lys Gly Glu Leu Lys Gly 
290 295 300 

gtc tta gac acc eta act gac egg gaa gaa aat gtc ttg cgc etc cgt 
Val Leu Asp Thr Leu Thr Asp Arg Glu Glu Asn Val Leu Arg Leu Arg 
305 310 315 

ttt ggc eta gat gat ggc cgt caa cgt act tta gaa gat gtc ggt aag 
Phe Gly Leu Asp Asp Gly Arg Gin Arg Thr Leu Glu Asp Val Gly Lys 
320 325 330 335 

gtc ttt ggg gtc acc egg gag egg ate cgt caa att gaa gcg aag gec 
Val Phe Gly Val Thr Arg Glu Arg lie Arg Gin lie Glu Ala Lys Ala 

340 345 ' 350 

etc cgc aaa etc cgc cac cct age egg tec aaa caa tta aaa gac ttt 
Leu Arg Lys Leu Arg His Pro Ser Arg Ser Lys Gin Leu Lys Asp Phe 

355 360 365 

tta gaa tag 
Leu Glu 



<210> 62 
<211> 369 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 62 

Met Val Glu Thr Lys Ser Lys Leu Glu Asn Ala Val Asn Thr Leu He 
15 10 15 



Lys Asp Leu Lys Asn Lys Lys Glu Ser Thr He Ser Tyr lie Asp Leu 

20 25 30 



Ser Asn Lys He Ala Glu Pro Phe Glu Leu Glu Ser Glu Ala Met Asp 
35 40 45 



Lys Leu He Gin Gin Leu Glu Asp Asp Gly He Gly Val Val Asp Gin 
50 55 60 



Asp Gly Asn Pro Leu Ala Lys Gin Leu Ala Lys Gin Glu Glu Glu Ala 
65 70 75 80 



Glu Lys Ala Lys Asp Glu Glu Met He Ala Pro Pro Gly Val Lys He 

85 90 95 



960 



1008 



1056 



1104 



1113 



Asn Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Arg Val Asp Leu 

100 105 HO 
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Leu Asp Ala Glu Glu Glu Val Ala Leu Ala Lys Arg lie Glu Glu Gly 
115 120 125 

Asp Glu lie Ala Lys Gin Glu Leu Ala Glu Ala Asn Leu Arg Leu Val 
130 135 140 



Val Ser He Ala Lys Arg Tyr Val Gly Arg Gly Met Ser Phe Leu Asp 
145 150 155 160 

Leu lie Gin Glu Gly Asn Met Gly Leu Met Lys Ala Val Glu Lys Phe 

165 170 175 



Asp Tyr Glu Lys Gly Phe Lys Phe Ser Thr Tyr Ala Thr Trp Trp lie 

180 185 190 



Arg Gin Ala lie Thr Arg Ala lie Ala Asp Gin Ala Arg Thr lie Arg 
195 200 205 



He Pro Val His Met Val Glu Thr He Asn Lys Leu Val Arg He Gin 
210 215 220 



Arg Gin Leu Leu Gin Glu Leu Gly Arg Glu Pro Thr Pro Glu Glu lie 

225 230 235 240 

Gly Ala Glu Met Asp Leu Pro Thr Glu Lys Val Arg Asp He Leu Lys 

245 250 255 



He Ser Gin Glu Pro Val Ser Leu Glu Thr Pro He Gly Glu Glu Glu 

260 265 270 



Asp Ser His Leu Gly Asp Phe He Glu Asp Asp Gly Ala Leu Ser Pro 
275 280 285 



Ser Asp Asn Ala Ala Tyr Glu Leu Leu Lys Gly Glu Leu Lys Gly Val 
290 295 300 



Leu Asp Thr Leu Thr Asp Arg Glu Glu Asn Val Leu Arg Leu Arg Phe 
305 310 315 320 



Gly Leu Asp Asp Gly Arg Gin Arg Thr Leu Glu Asp Val Gly Lys Val 

325 330 335 



Phe Gly Val Thr Arg Glu Arg He Arg Gin He Glu Ala Lys Ala Leu 
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340 345 350 



Arg Lys Leu Arg His Pro Ser Arg Ser Lys Gin Leu Lys Asp Phe Leu 
355 360 365 



Glu 



<210> 63 
<211> 1854 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (1) . . (1854) 

<223> 

<400> 63 

atg gtt aga ata cct gaa gag acc att aat caa ata cga age cag gca 

Met Val Arg lie Pro Glu Glu Thr He Asn Gin He Arg Ser Gin Ala 
1" 5 10 15 

gat att gtc gat gtc att ggc caa tac ttg gac tta aac aag tct ggg 
Asp He Val Asp Val He Gly Gin Tyr Leu Asp Leu Asn Lys Ser Gly 

20 25 30 

gec aat tac ttt gec cac tgc ccc ttc cat gaa gac age acg cct tct 
Ala Asn Tyr Phe Ala His Cys Pro Phe His Glu Asp Ser Thr Pro Ser 
35 40 45 

ttt teg gtc aac aga gac aag caa att tat aag tgc ttt tct tgc aaa 
Phe Ser Val Asn Arg Asp Lys Gin He Tyr Lys Cys Phe Ser Cys Lys 
50 55 60 

cga ggt ggc agt gtc ttt age ttt ata caa gag aag gag gga ctt tec 
Arg Gly Gly Ser Val Phe Ser Phe He Gin Glu Lys Glu Gly Leu Ser 
65 70 75 80 

ttc cca gaa teg gtt ctt aaa gtg gca gac tta get aat gtg gac ctt 
Phe Pro Glu Ser Val Leu Lys Val Ala Asp Leu Ala Asn Val Asp- Leu 

85 90 95 

gat ccg gec tta aaa gaa get gtc caa ggc caa cct gac aaa gec gat 
Asp Pro Ala Leu Lys Glu Ala Val Gin Gly Gin Pro Asp Lys Ala Asp 

100 105 HO 

tct ccc tac cga gac etc tat acc ate cat gac cag gee aag gac tac 
Ser Pro Tyr Arg Asp Leu Tyr Thr He His Asp Gin Ala Lys Asp Tyr 
115 120 125 

tac cag tat ate etc tta aag gec cag gtg gga gaa gtt get tac gac 
Tyr Gin Tyr He Leu Leu Lys Ala Gin Val Gly Glu Val Ala Tyr Asp 
130 135 140 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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tat etc cag aat cgt ggg att tec aga gag gtg atg gaa gag ttc gaa 480 
Tyr Leu Gin Asn Arg Gly lie Ser Arg Glu Val Met Glu Glu Phe Glu 
145 150 155 160 

ctg ggt tat tct ccc age caa agg gag teg etc cac ctt tat ttg cag 528 
Leu Gly Tyr Ser Pro Ser Gin Arg Glu Ser Leu His Leu Tyr Leu Gin 

165 170 175 

tec caa gac cag gcg gac ttg aca gat gac tta ctg gaa gaa acc ggc 576 
Ser Gin Asp Gin Ala Asp Leu Thr Asp Asp Leu Leu Glu Glu Thr Gly 

180 185 190 

ctt ttt tec aaa aga gaa gtg gaa agt gat agt ttt aaa gac cgc ttt 624 
Leu Phe Ser Lys Arg Glu Val Glu Ser Asp Ser Phe Lys Asp Arg Phe 
195 200 205 

gee aag egg ate ate ttc ccc tta aag aac tta caa ggg cag acg gtg 672 
Ala Lys Arg lie He Phe Pro Leu Lys Asn Leu Gin Gly Gin Thr Val 
210 215 220 

ggc ttt teg ggc egg tat ttc caa gat gag cct aac cag gac ttc cat 720 
Gly Phe Ser Gly Arg Tyr Phe Gin Asp Glu Pro Asn Gin Asp Phe His 
225 230 235 240 

cat gee aag tat tta aac agt cca gaa acc aaa ata ttc aat aaa egg 7 68 

His Ala Lys Tyr Leu Asn Ser Pro Glu Thr Lys He Phe Asn Lys Arg 

245 250 255 

egg acc etc ttt aac tac cac cag gee aag gee tac att cgt egg gee 816 
Arg Thr Leu Phe Asn Tyr His Gin Ala Lys Ala Tyr He Arg Arg Ala 

260 265 270 

aag gaa gtt gtc tta ttc gaa ggt tac atg gat gtg att get get tgg 864 
Lys Glu Val Val Leu Phe Glu Gly Tyr Met Asp Val He Ala Ala Trp 
275 280 285 

caa gcg ggg gtc aaa aat ggc tta get tec atg ggg acc agt ata aca 912 
Gin Ala Gly Val Lys Asn Gly Leu Ala Ser Met Gly Thr Ser He Thr 
290 295 300 

get gac caa gtc cag acc atg caa agg att get gac acc tta gtc ttg 960 
Ala Asp Gin Val Gin Thr Met Gin Arg lie Ala Asp Thr Leu Val Leu 
305 310 315 320 

gee ttt gac ggg gat gaa get ggc ctt gaa tec age aaa aag ate ctg 1008 
Ala Phe Asp Gly Asp Glu Ala Gly Leu Glu Ser Ser Lys Lys He Leu 

325 330 335 

gat gac tta age ttg acc age aag ctt caa att gaa gtg gtc att ttc 1056 
Asp Asp Leu Ser Leu Thr Ser Lys Leu Gin He Glu Val Val He Phe 

340 345 350 

cct aaa aaa atg gac ccg gat gaa tat att aga gaa aat gga cca gaa 1104 
Pro Lys Lys Met Asp Pro Asp Glu Tyr He Arg Glu Asn Gly Pro Glu 
355 360 365 



gec ttt caa aat etc ate caa cat ggt agg atg act gtc tac caa ttc 1152 
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Ala Phe Gin Asn Leu lie Gin His Gly Arg Met Thr Val Tyr Gin Phe 
370 375 380 

tta aaa gaa tac ttt aaa aaa tec tac aat eta gat aac gac teg gac 
Leu Lys Glu Tyr Phe Lys Lys Ser Tyr Asn Leu Asp Asn Asp Ser Asp 
385 390 395 400 

egg ttg aaa ttt ate caa ace atg ace aat aaa att ggc aag eta get 
Arg Leu Lys Phe lie Gin Thr Met Thr Asn Lys He Gly Lys Leu Ala 

405 410 415 

tec ccc ttg gaa agg gaa gtc tat gee aag gat ttg gca gaa gaa ttt 
Ser Pro Leu Glu Arg Glu Val Tyr Ala Lys Asp Leu Ala Glu Glu Phe 

420 425 430 

aac ctg tct tat gat acg att ata age caa gtt caa agt gaa gee act 
Asn Leu Ser Tyr Asp Thr He He Ser Gin Val Gin Ser Glu Ala Thr 
435 440 445 



caa gca aga gtg gaa gtc aaa gec cca agt agt caa aag act aag att 
Gin Ala Arg Val Glu Val Lys Ala Pro Ser Ser Gin Lys Thr Lys He 
465 470 475 480 

gac egg gee cag gaa aaa ctt 'tta aac cga etc ttt tac tat ccc caa 
Asp Arg Ala Gin Glu Lys Leu Leu Asn Arg Leu Phe Tyr Tyr Pro Gin 

485 490 495 

gtt caa gag ate ate gat get tat aat ccg gac ttt gaa ttt aaa acg 
Val Gin Glu He He Asp Ala Tyr Asn Pro Asp Phe Glu Phe Lys Thr 

500 505 510 

gaa gtc cac cag egg att tac etc ttg ttt tta gaa tac age cag gaa 
Glu Val His Gin Arg He Tyr Leu Leu Phe Leu Glu Tyr Ser Gin Glu 
515 520 525 



aaa gag gtc ata tct gat ata atg tgg aca tec att gag gtc gaa ccc 

Lys Glu Val He Ser Asp He Met Trp Thr Ser He Glu Val Glu Pro 

545 550 555 560 

tea gat gaa gaa ate eta gac tac ttg gac tac att gac caa ace tac 

Ser Asp Glu Glu He Leu Asp Tyr Leu Asp Tyr lie Asp Gin Thr Tyr 

565 570 575 



1200 



1248 



1296 



1344 



eta aac cag caa gag get ttg aaa aag gac egg cat aag gaa ttt tct 1392 
Leu Asn Gin Gin Glu Ala Leu Lys Lys Asp Arg His Lys Glu Phe Ser 
450 455 460 



1440 



1488 



1536 



1584 



aat gat age att gat tct ttc ate gat ttt gtc aaa gac aag gag acg 1632 
Asn Asp Ser He Asp Ser Phe He Asp Phe Val Lys Asp Lys Glu Thr 
530 535 540 



1680 



1728 



ccc ctg gag caa aaa cgc caa gac tgc ttg gag gaa gtc aaa gca get 1776 
Pro Leu Glu Gin Lys Arg Gin Asp Cys Leu Glu Glu Val Lys Ala Ala 

580 585 590 



aaa cag tec ggt aat aag aag cga gag ctg gaa tta ace aat caa tta 
Lys Gin Ser Gly Asn Lys Lys Arg Glu Leu Glu Leu Thr Asn Gin Leu 



1824 
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595 600 605 

att gaa ata aac cgt atg eta aaa caa taa 1854 
lie Glu lie Asn Arg Met Leu Lys Gin 
610 615 



<210> 64 
<211> 617 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 64 

Met Val Arg lie Pro Glu Glu Thr lie Asn Gin lie Arg Ser Gin Ala 
15 10 15 



Asp lie Val Asp Val lie Gly Gin Tyr Leu Asp Leu Asn Lys Ser Gly 

20 25 30 



Ala Asn Tyr Phe Ala His Cys Pro Phe His Glu Asp Ser Thr Pro Ser 
35 40 45 



Phe Ser Val Asn Arg Asp Lys Gin lie Tyr Lys Cys Phe Ser Cys Lys 
50 55 60 



Arg Gly Gly Ser Val Phe Ser Phe lie Gin Glu Lys Glu Gly Leu Ser 
65 70 75 80 



Phe Pro Glu Ser Val Leu Lys Val Ala Asp Leu Ala Asn Val Asp Leu 

85 90 95 



Asp Pro Ala Leu Lys Glu Ala Val Gin Gly Gin Pro Asp Lys Ala Asp 

100 105 110 



Ser Pro Tyr Arg Asp Leu Tyr Thr He His Asp Gin Ala Lys Asp Tyr 
115 120 125 



Tyr Gin Tyr He Leu Leu Lys Ala Gin Val Gly Glu Val Ala Tyr Asp 
130 135 140 



Tyr Leu Gin Asn Arg Gly He Ser Arg Glu Val Met Glu Glu Phe Glu 
145 150 155 160 



Leu Gly Tyr Ser Pro Ser Gin Arg Glu Ser Leu His Leu Tyr Leu Gin 

165 170 175 



WO 03/104391 PCIYUS02/36122 

136/235 



Ser Gin Asp Gin Ala Asp Leu Thr Asp Asp Leu Leu Glu Glu Thr Gly 

180 185 190 



Leu Phe Ser Lys Arg Glu Val Glu Ser Asp Ser Phe Lys Asp Arg Phe 
195 200 205 



Ala Lys Arg lie lie Phe Pro Leu Lys Asn Leu Gin Gly Gin Thr Val 
210 215 220 



Gly Phe Ser Gly Arg Tyr Phe Gin Asp Glu Pro Asn Gin Asp Phe His 
225 230 235 240 



His Ala Lys Tyr Leu Asn Ser Pro Glu Thr Lys lie Phe Asn Lys Arg 

245 250 255 



Arg Thr Leu Phe Asn Tyr His Gin Ala Lys Ala Tyr lie Arg Arg Ala 

260 265 270 



Lys Glu Val Val Leu Phe Glu Gly Tyr Met Asp Val lie Ala Ala Trp 
275 280 285 



Gin Ala Gly Val Lys Asn Gly Leu Ala Ser Met Gly Thr Ser lie Thr 
290 295 300 



Ala Asp Gin Val Gin Thr Met Gin Arg lie Ala Asp Thr Leu Val Leu 
305 310 315 320 



Ala Phe Asp Gly Asp Glu Ala Gly Leu Glu Ser Ser Lys Lys lie Leu 

325 330 335 



Asp Asp Leu Ser Leu Thr Ser Lys Leu Gin He Glu Val Val He Phe 

340 345 350 



Pro Lys Lys Met Asp Pro Asp Glu Tyr lie Arg Glu Asn Gly Pro Glu 
355 360 365 



Ala Phe Gin Asn Leu He Gin His Gly Arg Met Thr Val Tyr Gin Phe 
370 375 380 



Leu Lys Glu Tyr Phe Lys Lys Ser Tyr Asn Leu Asp Asn Asp Ser Asp 
385 390 395 400 



Arg Leu Lys Phe He Gin Thr Met Thr Asn Lys He Gly I»ys Leu Ala 
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405 410 415 



Ser Pro Leu Glu Arg Glu Val Tyr Ala Lys Asp Leu Ala Glu Glu Phe 

420 425 430 



Asn Leu Ser Tyr Asp Thr lie lie Ser Gin Val Gin Ser Glu Ala Thr 
435 440 445 



Leu Asn Gin Gin Glu Ala Leu Lys Lys Asp Arg His Lys Glu Phe Ser 
450 455 460 



Gin Ala Arg Val Glu Val Lys Ala Pro Ser Ser Gin Lys Thr Lys He 
465 470 475 480 



Asp Arg Ala Gin Glu Lys Leu Leu Asn Arg Leu Phe Tyr Tyr Pro Gin 

485 490 495 



Val Gin Glu He He Asp Ala Tyr Asn Pro Asp Phe Glu Phe Lys Thr 

500 505 510 



Glu Val His Gin Arg He Tyr Leu Leu Phe Leu Glu Tyr Ser Gin Glu 
515 520 525 



Asn Asp Ser He Asp Ser Phe He Asp Phe Val Lys Asp Lys Glu Thr 
530 535 540 



Lys Glu Val He Ser Asp He Met Trp Thr Ser lie Glu Val Glu Pro 
545 550 555 560 



Ser Asp Glu Glu He Leu Asp Tyr Leu Asp Tyr He Asp Gin Thr Tyr 

565 570 575 



Pro Leu Glu Gin Lys Arg Gin Asp Cys Leu Glu Glu Val Lys Ala Ala 

580 585 590 



Lys Gin Ser Gly Asn Lys Lys Arg Glu Leu Glu Leu Thr Asn Gin Leu 
595 600 605 



He Glu He Asn Arg Met Leu Lys Gin 
610 615 



<210> 65 
<211> 987 
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<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (55) . . (987) 

<223> 

<400> 65 

gccagacaat cggacaacta ttaccggaca actttaagcc aggggacatg gaat atg 57 

Met 
1 



gaa aac aat gaa aac aat gaa aac aaa gat age aaa aca ttt aaa tea 
Glu Asn Asn Glu Asn Asn Glu Asn Lys Asp Ser Lys Thr Pile Lys Ser 



5 10 



15 



etc aac caa ata tta ggc cag aag att acc att ate agt gac aaa ccc 
Leu Asn Gin lie Leu Gly Gin Lys lie Thr He lie Ser Asp Lys Pro 



35 40 



45 



caa aca acc egg aat aaa ate cag ggt att tac acc gac caa gcg ggg 
Gin Thr Thr Arg Asn Lys He Gin Gly He Tyr Thr Asp Gin Ala Gly 



50 55 60 



65 



caa att gtc ttt ate gac aca cct ggt ata cat aaa ccc aag cac cgc 
Gin He Val Phe He Asp Thr Pro Gly He His Lys Pro Lys His Arg 

70 75 80 

ctg ggc egg ttt atg gtg gat teg get atg teg acc ate aat gag gtg 
Leu Gly Arg Phe Met Val Asp Ser Ala Met Ser Thr He Asn Glu Val 

85 90 95 

gac ctg gtc tta ttt gtg gtc aat gtc agg gaa aag att ggc ccg ggg 
Asp Leu Val Leu Phe Val Val Asn Val Arg Glu Lys He Gly Pro Gly 
100 105 HO 

gac egg ttc att ate gac aag ttg cga acc ate gat acg cca gtt ttt 
Asp Arg Phe lie He Asp Lys Leu Arg Thr lie Asp Thr Pro Val Phe 
115 120 125 

tta att att aac cag att gac cag gtc gat cca aca gac etc eta ccg 
Leu He He Asn Gin He Asp Gin Val Asp Pro Thr Asp Leu Leu Pro 



130 135 



140 145 



160 



act tea ggc ttg gaa ggg gaa aat ate cag gag etc att caa acc ate 
Thr Ser Gly Leu Glu Gly Glu Asn He Gin Glu Leu He Gin Thr He 

165 170 175 



105 



ggt ttt gtc acc ctt ctt ggc egg ccc aat gtg ggc aag tea acc ctg 153 
Gly Phe Val Thr Leu Leu Gly Arg Pro Asn Val Gly Lys Ser Thr Leu 
20 25 30 



201 



249 



297 



345 



393 



441 



489 



gtt att age gac tac caa gag gaa ttc gac ttt gee gaa gtg gtt cca 537 
Val He Ser Asp Tyr Gin Glu Glu Phe Asp Phe Ala Glu Val Val Pro 

150 155 



585 
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aag tct tac eta cct gtt gga ccc caa ttt tac ccg gac gac cag gtc 633 
Lys Ser Tyr Leu Pro Val Gly Pro Gin Phe Tyr Pro Asp Asp Gin Val 
180 185 190 

teg gac cac ccc gaa tac ttt att att tea gaa etc ate egg gag aag 681 
Ser Asp His Pro Glu Tyr Phe lie lie Ser Glu Leu lie Arg Glu Lys 
195 200 205 

gtt tta gac ttg get aga gaa gag att cct cat tea gta gca gta gta 729 
Val Leu Asp Leu Ala Arg Glu Glu lie Pro His Ser Val Ala Val Val 
210 215 220 225 

act gag aag gta gac cga aac caa gat ggt aaa gtc caa acc tat gec 777 
Thr Glu Lys Val Asp Arg Asn Gin Asp Gly Lys Val Gin Thr Tyr Ala 

230 235 240 

acc att att gtc gaa cgc aag age caa aag ggg att att ate ggc aag 825 
Thr He He Val Glu Arg Lys Ser Gin Lys Gly He He He Gly Lys 

245 250 255 

caa ggg tec atg att aaa aaa att ggt age eta get egg cga gat att 873 
Gin Gly Ser Met He Lys Lys He Gly Ser Leu Ala Arg Arg Asp He 
260 265 270 

gag aaa eta ctg gga gat aag att tac ttg gaa etc tgg gtt aaa gtc 921 
Glu Lys Leu Leu Gly Asp Lys He Tyr Leu Glu Leu Trp Val Lys Val 
275 280 285 

caa aga gac tgg egg gac aag ccc agt cgc tta gaa gac ttt ggc tac 969 
Gin Arg Asp Trp Arg Asp Lys Pro Ser Arg Leu Glu Asp Phe Gly Tyr 
290 295 300 305 



aat gaa gac aac tat tag 
Asn Glu Asp Asn Tyr 

310 



<210> 66 
<211> 310 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 66 

Met Glu Asn Asn Glu Asn Asn Glu Asn Lys Asp Ser Lys Thr Phe Lys 
1 5 10 15 

Ser Gly Phe Val Thr Leu Leu Gly Arg Pro Asn Val Gly Lys Ser Thr 

20 25 30 

Leu Leu Asn Gin He Leu Gly Gin Lys He Thr He He Ser Asp Lys 
35 40 45 

Pro Gin Thr Thr Arg Asn Lys He Gin Gly He Tyr Thr Asp Gin Ala 
50 55 60 



987 
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Gly Gin lie Val Phe lie Asp Thr Pro Gly lie His Lys Pro Lys His 
65 70 75 80 



Arg Leu Gly Arg Phe Met Val Asp Ser Ala Met Ser Thr lie Asn Glu 

85 90 95 



Val Asp Leu Val Leu Phe Val Val Asn Val Arg Glu Lys lie Gly Pro 

100 105 110 



Gly Asp Arg Phe lie lie Asp Lys Leu Arg Thr lie Asp Thr Pro Val 
115 120 125 



Phe Leu lie lie Asn Gin lie Asp Gin Val Asp Pro Thr Asp Leu Leu 
130 135 140 



Pro Val lie Ser Asp Tyr Gin Glu Glu Phe Asp Phe Ala Glu Val Val 
145 150 155 160 



Pro Thr Ser Gly Leu Glu Gly Glu Asn He Gin Glu Leu He Gin Thr 

165 170 175 



He Lys Ser Tyr Leu Pro Val Gly Pro Gin Phe Tyr Pro Asp Asp Gin 

180 185 190 



Val Ser Asp His Pro Glu Tyr Phe He He Ser Glu Leu He Arg Glu 
195 200' 205 



Lys Val Leu Asp Leu Ala Arg Glu Glu He Pro His Ser Val Ala Val 
210 215 220 



Val Thr Glu Lys Val Asp Arg Asn Gin Asp Gly Lys Val Gin Thr Tyr 
225 230 235 240 



Ala Thr He He Val Glu Arg Lys Ser Gin Lys Gly He He He Gly 

245 250 255 



Lys Gin Gly Ser Met He Lys Lys He Gly Ser Leu Ala Arg Arg Asp 

260 265 270 



He Glu Lys Leu Leu Gly Asp Lys He Tyr Leu Glu Leu Trp Val Lys 
275 280 285 
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Val Gin Arg Asp Trp Arg Asp Lys Pro Ser Arg Leu Glu Asp Phe Gly 
290 295 300 



Tyr Asn Glu Asp Asn Tyr 
305 310 



<210> 67 
<211> 1557 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (46) . . (1557) 

<223> 

<400> 67 

catgtctatg ttactttaat cagtggaaaa caagaggaga tcatt gtg att tec tct 57 

Met lie Ser Ser 
1 

ttc tat tta gta gga gtc ttg aga ttg agt agt gaa aat aaa tta acc 105 
Phe Tyr Leu Val Gly Val Leu Arg Leu Ser Ser Glu Asn Lys Leu Thr 
5 10 15 20 

ttc aaa cac ttc ctt gca aac cag ttg acc aaa cga gac aat tta caa 153 
Phe Lys His Phe Leu Ala Asn Gin Leu Thr Lys Arg Asp Asn Leu Gin 

25 30 35 

ate ccc cgt tgg caa att ttt gec gtt tta ttt aca gga gec gtg att 201 
lie Pro Arg Trp Gin lie Phe Ala Val Leu Phe Thr Gly Ala Val lie 

40 45 50 

gtg gtt etc aac caa acg gec atg tct acc gec ttg cct aat atg att 249 
Val Val Leu Asn Gin Thr Ala Met Ser Thr Ala Leu Pro Asn Met He 
55 60 65 

gaa agt ttg ggc att gac cct age eta ggc cag tgg att gtc teg ggt 297 
Glu Ser Leu Gly He Asp Pro Ser Leu Gly Gin Trp He Val Ser Gly 
70 75 80 

tat acc ttg gtc aaa ggg att atg gtc ccc ata acc gee ttt gee atg 345 
Tyr Thr Leu Val Lys Gly He Met Val Pro He Thr Ala Phe Ala Met 
85 90 95 100 

acc aag tac egg aca egg aac ttt ttt att tta atg ttg gec etc ttc 393 
Thr Lys Tyr Arg Thr Arg Asn Phe Phe He Leu Met Leu Ala Leu Phe 

105 110 115 

tgt acc ggt agt ttt ttg act ggt ctg ggc ttt aat ttt ccg gtt gtg 441 
Cys Thr Gly Ser Phe Leu Thr Gly Leu Gly Phe Asn Phe Pro Val Val 

120 125 130 



gtc atg ggg aca gtc ate cag ggt ata gcg get ggg atg ate ate ccc 



489 



WO 03/104391 PCT/US02/36122 

142/235 

Val Met Gly Thr Val lie Gin Gly He Ala Ala Gly Met He He Pro 
135 140 145 

ttg atg cag acc gtc etc ttg acc ttg atg ccg gtt gaa age cga ggc 537 
Leu Met Gin Thr Val Leu Leu Thr Leu Met Pro Val Glu Ser Arg Gly 
150 155 160 

act get atg ggg gta atg agt ggg gtt att ggt att ggt cca gca ctg 585 
Thr Ala Met Gly Val Met Ser Gly Val He Gly He Gly Pro Ala Leu 
165 170 175 180 

ggt ccc ctt gtc ggt ggg gtc att gtt gat get ttc acc tgg gaa att 633 
Gly Pro Leu Val Gly Gly Val He Val Asp Ala Phe Thr Trp Glu He 

185 190 195 

tta ttc tac ate tgg gee tta ate acc ctt tta ttg gtt cct tta act 681 
Leu Phe Tyr He Trp Ala Leu He Thr Leu Leu Leu Val Pro Leu Thr 

200 205 210 

tgg ctg gtc tta ccc gat gta ttg cca aat gca gat tta acc att aat 729 
Trp Leu Val Leu Pro Asp Val Leu Pro Asn Ala Asp Leu Thr He Asn 
215 220 225 

tgg gec aat ate egg gac tec etc att ggt ttt ggc etc etc etc ttt 777 
Trp Ala Asn He Arg Asp Ser Leu He Gly Phe Gly Leu Leu Leu Phe 
230 235 240 

age ttg tea gtc ttt ggt tct tec ggt ttt tct teg gtc att gee tgg 825 
Ser Leu Ser Val Phe Gly Ser Ser Gly Phe Ser Ser Val He Ala Trp 
245 250 255 260 

gtc age ttg ctt ate ggt tta gtc ttt gtc gee aag ttt ate cac ttc 873 
Val Ser Leu Leu He Gly Leu Val Phe Val Ala Lys Phe He His Phe 

265 270 275 

aac etc aag gca gac caa cca ate tta aat ctt aga etc ttt aaa aaa 921 
Asn Leu Lys Ala Asp Gin Pro He Leu Asn Leu Arg Leu Phe Lys Lys 

280 285 290 

acc tat tac cgt egg get gtc ttg gta gec acc ttg ggg att gtc att 969 
Thr Tyr Tyr Arg Arg Ala Val Leu Val Ala Thr Leu Gly He Val He 
295 300 305 

att tct tgt eta tec aac att ate cct att tat gtt caa act gtt agg 1017 
He Ser Cys Leu Ser Asn He He Pro He Tyr Val Gin Thr Val Arg 
310 315 320 

ggc ttg ggg get tec ata gca ggc tta ate tta atg cca get ggt ate 1065 
Gly Leu Gly Ala Ser He Ala Gly Leu He Leu Met Pro Ala Gly He 
325 330 335 340 

ate aaa acc ate tta get cct ate tea ggc aaa ctt tat gac aag gtt 1113 
He Lys Thr He Leu Ala Pro He Ser Gly Lys Leu Tyr Asp Lys Val 

345 350 355 



gga gtg get egg att ggc ctt ate ggt ggt ate tta ctt tta gtt ggg 
Gly Val Ala Arg He Gly Leu He Gly Gly He Leu Leu Leu Val Gly 



1161 
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360 365 370 

tec tta tta eta gtt ace etc aat gaa get age tec ctt tac tta ctg 1209 
Ser Leu Leu Leu Val Thr Leu Asn Glu Ala Ser Ser Leu Tyr Leu Leu 
375 380 385 

atg att tac tac ggc ate tta tea gec ggt ttt ggc ttg ttt aat ate 1257 
Met lie Tyr Tyr Gly lie Leu Ser Ala Gly Phe Gly Leu Phe Asn lie 
390 395 400 

cct att acc act get ggc atg aat att atg gec aag gaa gat atg gga 1305 
Pro lie Thr Thr Ala Gly Met Asn lie Met Ala Lys Glu Asp Met Gly 
405 410 415 420 

cat gcg act tea gec egg caa acg gtc egg caa ate tct tea agt ttt 1353 
His Ala Thr Ser Ala Arg Gin Thr Val Arg Gin lie Ser Ser Ser Phe 

425 430 435 

gec gtt tec etc tec ttt ate ate atg acc ctg gtg act att gee act 1401 
Ala Val Ser Leu Ser Phe lie lie Met Thr Leu Val Thr lie Ala Thr 

440 445 450 

tec ggc caa teg gtg ggg gtt ttc caa gat ggc ggt ccg aca gac tta 1449 
Ser Gly Gin Ser Val Gly Val Phe Gin Asp Gly Gly Pro Thr Asp Leu 
455 460 465 

aat atg gca gga gtc cga ggc gec ttt ate ttg gtg get ata ttt tea 1497 
Asn Met Ala Gly Val Arg Gly Ala Phe lie Leu Val Ala lie Phe Ser 
470 475 480 

ate eta gec atg ate ttg ate ttc ttt tta aaa gac cct aaa gaa aaa 1545 
lie Leu Ala Met lie Leu lie Phe Phe Leu Lys Asp Pro Lys Glu Lys 
485 490 495 500 

cca gac caa tag 1557 
Pro Asp Gin 

<210> 68 
<211> 503 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 68 

Met He Ser Ser Phe Tyr Leu Val Gly Val Leu Arg Leu Ser Ser Glu 
15 10 15 

Asn Lys Leu Thr Phe Lys His Phe Leu Ala Asn Gin Leu Thr Lys Arg 

20 25 30 

Asp Asn Leu Gin He Pro Arg Trp Gin He Phe Ala Val Leu Phe Thr 
35 40 45 



Gly Ala Val He Val Val Leu Asn Gin Thr Ala Met Ser Thr Ala Leu 
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50 55 60 

Pro Asn Met He Glu Ser Leu Gly lie Asp Pro Ser Leu Gly Gin Trp 
65 70 75 80 

lie val Ser Gly Tyr Thr Leu Val Lys Gly lie Met Val Pro lie Thr 

85 90 95 

Ala Phe Ala Met Thr Lys Tyr Arg Thr Arg Asn Phe Phe lie Leu Met 

100 105 110 

Leu Ala Leu Phe Cys Thr Gly Ser Phe Leu Thr Gly Leu Gly Phe Asn 
115 120 125 

Phe Pro Val Val Val Met Gly Thr Val lie Gin Gly lie Ala Ala Gly 
130 135 140 

Met lie lie Pro Leu Met Gin Thr Val Leu Leu Thr Leu Met Pro Val 
I 45 150 155 160 

Glu Ser Arg Gly Thr Ala Met Gly Val Met Ser Gly Val lie Gly lie 

165 170 175 

Gly Pro Ala Leu Gly Pro Leu Val Gly Gly Val lie Val Asp Ala Phe 

180 185 190 

Thr Trp Glu He Leu Phe Tyr He Trp Ala Leu He Thr Leu Leu Leu 
195 200 205 

Val Pro Leu Thr Trp Leu Val Leu Pro Asp Val Leu Pro Asn Ala Asp 
210 215 220 

Leu Thr He Asn Trp Ala Asn He Arg Asp Ser Leu He Gly Phe Gly 
225 230 235 240 

Leu Leu Leu Phe Ser Leu Ser Val Phe Gly Ser Ser Gly Phe Ser Ser 

245 250 255 

Val He Ala Trp Val Ser Leu Leu He Gly Leu Val Phe Val Ala Lys 

260 265 270 



Phe He His Phe Asn Leu Lys Ala Asp Gin Pro He Leu Asn Leu Arg 
275 280 285 
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Leu Phe Lys Lys Thr Tyr Tyr Arg Arg Ala Val Leu Val Ala Thr Leu 
290 295 300 



Gly lie Val lie He Ser Cys Leu Ser Asn He lie Pro He Tyr Val 
305 310 315 320 



Gin Thr Val Arg Gly Leu Gly Ala Ser He Ala Gly Leu He Leu Met 

325 330 335 



Pro Ala Gly He He Lys Thr He Leu Ala Pro He Ser Gly Lys Leu 

340 345 350 



Tyr Asp Lys Val Gly Val Ala Arg He Gly Leu He Gly Gly He Leu 
355 360 365 



Leu Leu Val Gly Ser Leu Leu Leu Val Thr Leu Asn Glu Ala Ser Ser 
370 375 380 



Leu Tyr Leu Leu Met He Tyr Tyr Gly He Leu Ser Ala Gly Phe Gly 
385 390 395 400 



Leu Phe Asn He Pro He Thr Thr Ala Gly Met Asn He Met Ala Lys 

405 410 415 



Glu Asp Met Gly His Ala Thr Ser Ala Arg Gin Thr Val Arg Gin He 

420 425 430 



Ser Ser Ser Phe Ala Val Ser Leu Ser Phe He He Met Thr Leu Val 
435 440 445 



Thr lie Ala Thr Ser Gly Gin Ser Val Gly Val Phe Gin Asp Gly Gly 
450 455 460 



Pro Thr Asp Leu Asn Met Ala Gly Val Arg Gly Ala Phe He Leu Val 
465 470 475 480 



Ala He Phe Ser He Leu Ala Met He Leu He Phe Phe Leu Lys Asp 

485 490 495 



Pro Lys Glu Lys Pro Asp Gin 

500 
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<210> 69 
<211> 4392 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (58) . . (4392) 
<223> 

<400> 69 

aagggttcag gactttcgta tctggccctt tttggctcat tagaaagcag ggcaaag 57 



atg tea etc 
Met Ser Leu 
1 

cac tta gaa 
His Leu Glu 



ttg aag caa 
Leu Lys Gin 
35 

etc caa ttt 
Leu Gin Phe 
50 

tct gec etc 
Ser Ala Leu 
65 

gtt gat gec 
Val Asp Ala 



tgg cct aag 
Trp Pro Lys 



gac tta eta 
Asp Leu Leu 
115 

ttt gac ctg 
Phe Asp Leu 
13 0 

eta cct egg 
Leu Pro Arg 
145 

ttt aaa ate 



aat caa aaa 
Asn Gin Lys 
5 

gaa cac eta 
Glu His Leu 
20 

att gtt gtt 
He Val Val 



cct cag ate 
Pro Gin He 



ttg cag cat 
Leu Gin His 
70 

caa gat gac 
Gin Asp Asp 
85 

gcg gtg aag 
Ala Val Lys 
100 

gac aag acc 
Asp Lys Thr 



gac cat gaa 
Asp His Glu 



ate caa get 
He Gin Ala 
150 

aag get agg 



gaa atg tat 
Glu Met Tyr 



caa gac cga 
Gin Asp Axg 
25 

tac aag get 
Tyr Lys Ala 
40 

etc cct ttt 
Leu Pro Phe 
55 

ate cca gaa 
lie Pro Glu 



agt ttt gac 
Ser Phe Asp 



ttt age gga 
Phe Ser Gly 
105 

etc cct tat 
Leu Pro Ty r 
120 

gtg acc egg 
Val Thr Arg 
135 

ggc tac cag 
Gly Tyr Gin 



gtc gat gec 



caa gta ttg 
Gin Val Leu 
10 

ccc ctt ctt 
Pro Leu Leu 



caa caa gec 
Gin Gin Ala 



aag gac ttc 
Lys Asp Phe 
60 

gtc aac cag 
Val Asn Gin 
75 

cag gac etc 
Gin Asp Leu 
90 

gtc gat tct 
Val Asp Ser 



eta gat ggg 
Leu Asp Gly 



gac aag ttt 
Asp Lys Phe 
140 

caa gtg ggc 
Gin Val Gly 
155 

cag aaa aat 



atg cag caa 
Met Gin Gin 
15 

aaa gee ggc 
Lys Ala Gly 
30 

tgg gac ctg 
Trp Asp Leu 
45 

caa gtt ttg 
Gin Val Leu 



ate cat tta 
He His Leu 



etc cag gac 
Leu Gin Asp 
95 

ccc ctt tgc 
Pro Leu Cys 
110 

aag caa gtt 
Lys Gin Val 
125 

gac cat gac 
Asp His Asp 



ttt ccc aac 
Phe Pro Asn 



tea gat caa 



gtc 105 
Val 



agt 153 
Ser 



acc 2 01 

Thr 



gag 249 
Glu 



agg 297 

Arg 

80 

tat 345 
Tyr 



aat 393 
Asn 



tac 441 
Tyr 



ttc 489 
Phe 



cac 537 

His 

160 

att 585 
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Phe Lys lie Lys Ala Arg Val Asp Ala Gin Lys Asn Ser Asp Gin lie 

165 170 175 

gcc gcc ttc cgt aaa gaa aaa gaa gaa aaa gac cag gcc ttg tct caa 
Ala Ala Phe Arg Lys Glu Lys Glu Glu Lys Asp Gin Ala Leu Ser Gin 

L80 185 190 

gag eta acc aac caa ttt ate aag gcc age caa aag aaa gaa gaa ggg 
Glu Leu Thr Asn Gin Phe He Lys Ala Ser Gin Lys Lys Glu Glu Gly 
195 200 205 

gga tec aaa gcc aag teg gag gcc ttg aag atg ggc egg gcc ate cct 
Gly Ser Lys Ala Lys Ser Glu Ala Leu Lys Met Gly Arg Ala He Pro 
210 215 220 



cgt ctg acc ttt gaa gga tac gtt ttt gat gtg gaa ate aaa tec etc 
Arg Leu Thr Phe Glu Gly Tyr Val Phe Asp Val Glu He Lys Ser Leu 

245 250 255 

egg tea gat aga aag etc ctt etc ttt aaa atg acc gac tat age tct 
Arg Ser Asp Arg Lys Leu Leu Leu Phe Lys Met Thr Asp Tyr Ser Ser 

260 265 270 

tec ttc eta ttc aaa aaa ttc tct aat aat tct tct gac gaa gcc eta 
Ser Phe Leu Phe Lys Lys Phe Ser Asn Asn Ser Ser Asp Glu Ala Leu 
275 280 285 

ttt gac caa gtc caa gag gga atg tgg etc aag gtt aga ggc agt gtt 
Phe Asp Gin Val Gin Glu Gly Met Trp Leu Lys Val Arg Gly Ser Val 
290 295 300 

caa gaa gat acc ttt gtc aaa gac eta gtt gtc atg gcc caa gac ate 
Gin Glu Asp Thr Phe Val Lys Asp Leu Val Val Met Ala Gin Asp lie 
305 310 315 320 

caa gag gtc aaa aaa gaa ccc egg egg gac ctg get aag gaa ggg gag 
Gin Glu Val Lys Lys Glu Pro Arg Arg Asp Leu Ala Lys Glu Gly Glu 

325 330 335 



ttg gtg ccg gcc aag gat ttg gtc aag caa gca gcc get ttt gac caa 
Leu Val Pro Ala Lys Asp Leu Val Lys Gin Ala Ala Ala Phe Asp Gin 
355 360 365 

ccg get att gcc ate act gat cat get gta gtc caa tec ttc cca gag 
Pro Ala He Ala He Thr Asp His Ala Val Val Gin Ser Phe Pro Glu 
370 375 380 



633 



681 



729 



gac cac gaa acg att acc cag atg gtt gat gtg gaa gaa gaa gag age 777 
Asp His Glu Thr lie Thr Gin Met Val Asp Val Glu Glu Glu Glu Ser 
225 230 235 240 



825 



873 



921 



969 



1017 



.1065 



aag agg gtg gaa ctt cat gcc cat acc acc atg agt cag atg gac ggt 1113 
Lys Arg Val Glu Leu His Ala His Thr Thr Met Ser Gin Met Asp Gly 

340 345 350 



1161 



1209 



gcc cat tat get ggc tta gac act ggt gtt aaa att ctt tac ggt gtg 1257 
Ala His Tyr Ala Gly Leu Asp Thr Gly Val Lys He Leu Tyr Gly Val 
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385 390 395 400 

gaa gcc aat ttg gtt agt gat ggc gaa ttg gta gca tac aat ccg gcc 1305 
Glu Ala Asn Leu Val Ser Asp Gly Glu Leu Val Ala Tyr Asn Pro Ala 

405 410 415 

gat ata aag ctg gaa gag gca act tat gtg gtc ttc gac gtg gaa aca 1353 
Asp lie Lys Leu Glu Glu Ala Thr Tyr Val Val Phe Asp Val Glu Thr 

420 425 430 

acc gga eta teg get cgt tat gac caa ate att gaa ttg gcc get gtg 1401 
Thr Gly Leu Ser Ala Arg Tyr Asp Gin lie lie Glu Leu Ala Ala Val 
435 440 445 

aag atg gaa aat ggg gaa ate gtt tct gaa ttc caa gaa ttt att gac 1449 
Lys Met Glu Asn Gly Glu lie Val Ser Glu Phe Gin Glu Phe lie Asp 
450 455 460 

cca ggc cag ccc ttg tct gag act acg acc aat ttg acc ggg ate acc 1497 
Pro Gly Gin Pro Leu Ser Glu Thr Thr Thr Asn Leu Thr Gly lie Thr 
465 470 475 480 

gat gac atg gtc caa gga tec aaa agt gaa gac gaa gtc etc cat gcc 1545 
Asp Asp Met Val Gin Gly Ser Lys Ser Glu Asp Glu Val Leu His Ala 

485 490 495 

ttt caa gcc ttt tea gaa ggc act gtc ttg gtc gcc cat aac get tec 1593 
Phe Gin Ala Phe Ser Glu Gly Thr Val Leu Val Ala His Asn Ala Ser 

500 505 510 

ttt gac atg ggc ttt ate aat acg gcc tac caa cga cat ggc eta gga 1641 
Phe Asp Met Gly Phe lie Asn Thr Ala Tyr Gin Arg His Gly Leu Gly 
515 520 525 

caa get gac cag cct gtg att gat acc ttg gaa ttg tec cgc atg etc 1689 
Gin Ala Asp Gin Pro Val lie Asp Thr Leu Glu Leu Ser Arg Met Leu 
530 535 540 

cac cca aac ttg aaa age cac egg tta aac act ctg get aag egg tat 1737 
His Pro Asn Leu Lys Ser His Arg Leu Asn Thr Leu Ala Lys Arg Tyr 
545 550 555 560 

gac gtg gcc tta gaa cac cac cac egg gcc ate tat gac teg gag tea 1785 
Asp Val Ala Leu Glu His His His Arg Ala lie Tyr Asp Ser Glu Ser 

565 570 575 

acg get aaa etc ttg tgg ate ttc tta aaa gaa gcc aaa gac caa tat 1833 
Thr Ala Lys Leu Leu Trp lie Phe Leu Lys Glu Ala Lys Asp Gin Tyr 

580 585 590 

gac atg act age cac caa gac ttg aat age cag gtg ggg gaa ggc gag 1881 
Asp Met Thr Ser His Gin Asp Leu Asn Ser Gin Val Gly Glu Gly Glu 
595 600 605 

get tac aag cag gcc egg cca acc cat gcc agt att ttg gtc aag aat 1929 
Ala Tyr Lys Gin Ala Arg Pro Thr His Ala Ser lie Leu Val Lys Asn 
610 615 620 
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caa aaa ggc ttg aaa aac etc ttt aaa att gtc tec cac gee cat gtc 1977 
Gin Lys Gly Leu Lys Asn Leu Phe Lys He Val Ser His Ala His Val 
625 630 635 640 

aac tac ttc tac egg gtt ccc cgt ata cct aag tct ate ttg age aag 2025 
Asn Tyr Phe Tyr Arg Val Pro Arg He Pro Lys Ser He Leu Ser Lys 

645 650 655 

tac egg gaa ggc ctt ttg gtt ggg tct ggt tgc gga cag gga gag etc 2073 
Tyr Arg Glu Gly Leu Leu Val Gly Ser Gly Cys Gly Gin Gly Glu Leu 

660 665 670 

ttt gag get att atg caa aag ggc tat gac gaa gee ttg gca gtt gec 2121 
Phe Glu Ala He Met Gin Lys Gly Tyr Asp Glu Ala Leu Ala Val Ala 
675 680 685 

cag gac tat gat tat att gaa gtt atg ccc aag tea gee tat att gac 2169 
Gin Asp Tyr Asp Tyr He Glu Val Met Pro Lys Ser Ala Tyr He Asp 
690 695 700 

etc ttg gac egg gac tta ate aag gat gag gca ace ctt gaa gaa atg 2217 
Leu Leu Asp Arg Asp Leu He Lys Asp Glu Ala Thx Leu Glu Glu Met 
705 710 715 720 

att gaa aac ctg gtt aaa ata ggc cat gaa ctt gat ata ccc gtg gta 2265 
He Glu Asn Leu Val Lys He Gly His Glu Leu Asp He Pro Val Val 

725 730 735 

get aca ggg aat gtc cac tac eta aac cca gaa gat gec gtt tta egg 2313 
Ala Thr Gly Asn Val His Tyr Leu Asn Pro Glu Asp Ala Val Leu Arg 

740 745 750 

gat ate etc ctg gaa act gee aaa aag gga gec ttc tec aaa gec egg 2361 
Asp He Leu Leu Glu Thr Ala Lys Lys Gly Ala Phe Ser Lys Ala Arg 
755 760 765 

aac cca gaa gtc cac ttt aga aca aca gat gaa atg tta gaa gag ttt 2409 
Asn Pro Glu Val His Phe Arg Thr Thr Asp Glu Met Leu Glu Glu Phe 
770 775 780 

tec ttc eta ggc cag gac cag get tat gag att gtg gtc ace aac ace 2457 
Ser Phe Leu Gly Gin Asp Gin Ala Tyr Glu He Val Val Thr Asn Thr 
785 790 795 800 

caa aaa att get gat tct ate gaa tea ate tct cct gtc aag gaa ggc 2505 
Gin Lys He Ala Asp Ser lie Glu Ser He Ser Pro Val Lys Glu Gly 

805 810 815 

etc tat gec ccg aaa atg gaa ggg teg gac caa gag ata cgt cag atg 2553 
Leu Tyr Ala Pro Lys Met Glu Gly Ser Asp Gin Glu He Arg Gin Met 

820 825 830 

agt tac aag caa gee aag get etc tat ggc gac ccc ttg cca agt att 2601 
Ser Tyr Lys Gin Ala Lys Ala Leu Tyr Gly Asp Pro Leu Pro Ser He 
835 840 845 
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gta gag gaa agg etc gaa aaa gag ttg aag agt att att gac aac aat 2649 
Val Glu Glu Arg Leu Glu Lys Glu Leu Lys Ser lie lie Asp Asn Asn 
850 855 860 

ttc tct gtc att tac tta att tec cag aaa ttg gtc aaa aaa agt gtt 2697 
Phe Ser Val lie Tyr Leu lie Ser Gin Lys Leu Val Lys Lys Ser Val 
865 870 875 880 

gaa gat ggc tat ttg gtt ggt tec agg ggg teg gtt ggg tea age ttt 2745 
Glu Asp Gly Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe 

885 890 895 

gtg gec acc atg acc ggg ate aca gaa gtc aac cca eta ccg ccc cac 2793 
Val Ala Thr Met Thr Gly He Thr Glu Val Asn Pro Leu Pro Pro His 

900 905 910 

tac cgc tgt cct aac tgc cag cac acc gaa ttc ttc aca aat ggg gaa 2841 
Tyr Arg Cys Pro Asn Cys Gin His Thr Glu Phe Phe Thr Asn Gly Glu 
915 920 925 

,9"tg ggg tec ggc ttt gac tta gag gec aaa aaa tgt ccg gaa tgt caa 2889 
Val Gly Ser Gly Phe Asp Leu Glu Ala Lys Lys Cys Pro Glu Cys Gin 
930 935 940 

age eta atg gaa tea gac ggc cac gac att ccc ttc gaa acc ttc ctt 2937 
Ser Leu Met Glu Ser Asp Gly His Asp He Pro Phe Glu Thr Phe Leu 
945 950 955 960 

ggt ttt aat ggg gac aag gtg cca gat ate gat ttg aac ttc tea ggt 2985 
Gly Phe Asn Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly 

965 970 975 

gaa tac cag gec aag gec cac aac tat acc aag gtt ttg ttt gga gaa 3033 
Glu Tyr Gin Ala Lys Ala His Asn Tyr Thr Lys Val Leu Phe Gly Glu 

980 985 990 

gac cat gtc tac egg gca ggg acc ate acg acg att get gac aag acg 3 081 

Asp His Val Tyr Arg Ala Gly Thr He Thr Thr He Ala Asp Lys Thr 
995 1000 1005 

gee ttt ggt ttt gtc aag ggt tat gaa agg gac aag cag ata aac tac 3129 
Ala Phe Gly Phe Val Lys Gly Tyr Glu Arg Asp Lys Gin He Asn Tyr 
1010 1015 1020 

egg teg get gaa gtg gac egg ctg tea gat ggt tta acc gga gtg aga 3177 
Arg Ser Ala Glu Val Asp Arg Leu Ser Asp Gly Leu Thr Gly Val Arg 
1025 1030 1035 1040 

egg tea acc ggc cag cac cca gga ggg att ate gtc ata ccg gat gac 3225 
Arg Ser Thr Gly Gin His Pro Gly Gly He lie Val He Pro Asp Asp 

1045 1050 1055 

atg gat gtg ttt gat ttc acc ccc ate cag tac ccg get gac gac cag 3273 
Met Asp Val Phe Asp Phe Thr Pro He Gin Tyr Pro Ala Asp Asp Gin 

1060 1065 1070 



acg get gag tgg caa act acc cac ttt gac ttc cac tec ate gac gaa 



3321 
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Thr Ala Glu Trp Gin Thr Thr His Phe Asp Phe His Ser He Asp Glu 
1075 1080 1085 

aac gtc ttg aag ctg gat ate ctg gga cat gat gac ccg acc atg ate 33 69 

Asn Val Leu Lys Leu Asp He Leu Gly His Asp Asp Pro Thr Met He 
1090 1095 1100 

cga aaa etc cag gac ttg tec ggc ttt gac cct caa gaa ata ccg gta 3417 
Arg Lys Leu Gin Asp Leu Ser Gly Phe Asp Pro Gin Glu He Pro Val 
1105 1110 1115 1120 

agt gat gaa gat gtt atg aaa att ttc tea ggc ccg gaa gtt eta ggg 3465 
Ser Asp Glu Asp Val Met Lys He Phe Ser Gly Pro Glu Val Leu Gly 

1125 1130 1135 

gtg acc cca gag caa att ttc tec aat acc gga act etc gga gta cct 3513 
Val Thr Pro Glu Gin He Phe Ser Asn Thr Gly Thr Leu Gly Val Pro 

1140 1145 1150 

gaa ttt ggt acc caa ttt gtc cga . gaa atg tta gag caa acc cac ccc 3561 
Glu Phe Gly Thr Gin Phe Val Arg Glu Met Leu Glu Gin Thr His Pro 
1155 1160 1165 

tct acc ttt get gaa etc ttg cag ate tea ggc etc tec cac ggg aca 3 609 

Ser Thr Phe Ala Glu Leu Leu Gin He Ser Gly Leu Ser His Gly Thr 
1170 1175 1180 

gat gtt tgg ctg ggc aat get gaa gaa tta att cgc aac cac aac att 3 657 

Asp Val Trp Leu Gly Asn Ala Glu Glu Leu He Arg Asn His Asn He 
H85 1190 1195 1200 

ccc ttg tec gag gtg ate ggc tgc egg gat gat ate atg gtc tac ctt 3705 
Pro Leu Ser Glu Val He Gly Cys Arg Asp Asp He Met Val Tyr Leu 

1205 1210 1215 

caa cac caa ggt ctt gaa gac age ctg gee ttt aag att atg gaa ttt 3753 
Gin His Gin Gly Leu Glu Asp Ser Leu Ala Phe Lys He Met Glu Phe 

1220 1225 1230 



gtt cgt aag ggt egg ggc ttg caa gat gac tgg att get acc atg aaa 
Val Arg Lys Gly Arg Gly Leu Gin Asp Asp Trp He Ala Thr Met Lys 
1235 1240 1245 



agg gta get tac ttt aaa gtc cac tac ccc ctt tac tac tac get gec 
Arg Val Ala Tyr Phe Lys Val His Tyr Pro Leu Tyr Tyr Tyr Ala Ala 

1285 1290 1295 



3801 



gaa aat gat gtt cct gat tgg tat att gaa tec tgc aaa aaa ate aag 3849 
Glu Asn Asp Val Pro Asp Trp Tyr He Glu Ser Cys Lys Lys He Lys 
1250 1255 1260 

tac atg ttc cct aaa gee cac gca get gee tat gtc ttg atg gec ctt 3897 
Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu Met Ala Leu 
1265 1270 1275 1280 



3945 



tac ttt tec ate egg get agt gat ttt gac tta att get atg gtc aag 
Tyr Phe Ser He Arg Ala Ser Asp Phe Asp Leu He Ala Met Val Lys 



3993 
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1300 1305 1310 

ggc aag gaa ggc att aaa ggg get atg aag gaa ate agg gac aag gaa 4041 

Gly Lys Glu Gly lie Lys Gly Ala Met Lys Glu lie Arg Asp Lys Glu 

1315 1320 1325 

aga gaa aaa act gec aca get aag gac aaa gee ttg etc acc gtc ctt 4089 

Arg Glu Lys Thr Ala Thr Ala Lys Asp Lys Ala Leu Leu Thr Val Leu 

1330 1335 1340 

gaa gta gec aat gaa atg gtt gaa egg ggt ttt gac ttc aag atg gtg 4137 
Glu Val Ala Asn Glu Met Val Glu Arg Gly Phe Asp Phe Lys Met Val 

1345 1350 1355 1360 

gac ate aac aag tec caa gee aaa gac ttt gtc ate gaa gac aat ggc 4185 

Asp lie Asn Lys Ser Gin Ala Lys Asp Phe Val lie Glu Asp Asn Gly 

1365 1370 1375 

ctt cgt get cca ttt agg gca gtc cct tec ttg ggg tec agt gee gee 4233 

Leu Arg Ala Pro Phe Arg Ala Val Pro Ser Leu Gly Ser Ser Ala Ala 

1380 1385 1390 

cag get gtc att gat gec agg gag gac age gac ttc ttg tec aag gaa 42 81 

Gin Ala Val lie Asp Ala Arg Glu Asp Ser Asp Phe Leu Ser Lys Glu 

1395 1400 1405 

gac eta tea aaa egg ggc aag ttg teg aaa acg gtc atg gac tac ctg 4329 

Asp Leu Ser Lys Arg Gly Lys Leu Ser Lys Thr Val Met Asp Tyr Leu 

1410 1415 1420 

gac aat aac cac gtt tta gac cac ctg ccg gac gaa aac caa ctt tec 4377 

Asp Asn Asn His Val Leu Asp His Leu Pro Asp Glu Asn Gin Leu Ser 

1425 1430 1435 1440 

etc ttt gac ttt taa 4392 
Leu Phe Asp Phe 



<210> 70 
<211> 1444 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 70 

Met Ser Leu Asn Gin Lys Glu Met Tyr Gin Val Leu Met Gin Gin Val 
1 . . . 5 10 15 



His Leu Glu Glu His Leu Gin Asp Arg Pro Leu Leu Lys Ala Gly Ser 

20 25 30 



Leu Lys Gin He Val Val Tyr Lys Ala Gin Gin Ala Trp Asp Leu Thr 
35 40 45 



Leu Gin Phe Pro Gin He Leu Pro Phe Lys Asp Phe Gin Val Leu Glu 
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50 55 60 



Ser Ala Leu Leu Gin His He Pro Glu Val Asn Gin lie His Leu Arg 
65 70 75 80 



Val Asp Ala Gin Asp Asp Ser Phe Asp Gin Asp Leu Leu Gin Asp Tyr 

85 90 95 



Trp Pro Lys Ala Val Lys Phe Ser Gly Val Asp Ser Pro Leu Cys Asn 

100 105 110 



Asp Leu Leu Asp Lys Thr Leu Pro Tyr Leu Asp Gly Lys Gin Val Tyr 
115 120 125 



Phe Asp Leu Asp His Glu Val Thr Arg Asp Lys Phe Asp His Asp Phe 
130 " 135 140 



Leu Pro Arg He Gin Ala Gly Tyr Gin Gin Val Gly Phe Pro Asn His 
145 150 155 160 



Phe Lys He Lys Ala Arg Val Asp Ala Gin Lys Asn Ser Asp Gin He 

165 170 175 



Ala Ala Phe Arg Lys Glu Lys Glu Glu Lys Asp Gin Ala Leu Ser Gin 

180 185 190 



Glu Leu Thr Asn Gin Phe He Lys Ala Ser Gin Lys Lys Glu Glu Gly 
195 200 205 



Gly Ser Lys Ala Lys Ser Glu Ala Leu Lys Met Gly Arg Ala He Pro 
210 215 220 



Asp His Glu Thr He Thr Gin Met Val Asp Val Glu Glu Glu Glu Ser 
225 230 235 240 



Arg Leu Thr Phe Glu Gly Tyr Val Phe Asp Val Glu He Lys Ser Leu 

245 250 255 



Arg Ser Asp Arg Lys Leu Leu Leu Phe Lys Met Thr Asp Tyr Ser Ser 

260 265 270 



Ser Phe Leu Phe Lys Lys Phe Ser Asn Asn Ser Ser Asp Glu Ala Leu 
275 280 285 
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Phe Asp Gin Val Gin Glu Gly Met Trp Leu Lys Val Arg Gly Ser Val 
290 295 300 

Gin Glu Asp Thr Phe Val Lys Asp Leu Val Val Met Ala Gin Asp lie 
305 310 315 320 

Gin Glu Val Lys Lys Glu Pro Arg Arg Asp Leu Ala Lys Glu Gly Glu 

325 330 335 



Lys Arg Val Glu Leu His Ala His Thr Thr Met Ser Gin Met Asp Gly 

340 345 350 



Leu Val Pro Ala Lys Asp Leu Val Lys Gin Ala Ala Ala Phe Asp Gin 
355 360 365 



Pro Ala lie Ala lie Thr Asp His Ala Val Val Gin Ser Phe Pro Glu 
370 375 380 



Ala His Tyr Ala Gly Leu Asp Thr Gly Val Lys He Leu Tyr Gly Val 
385 390 395 400 



Glu Ala Asn Leu Val Ser Asp Gly Glu Leu Val Ala Tyr Asn Pro Ala 

405 410 415 



Asp He Lys Leu Glu Glu Ala Thr Tyr Val Val Phe Asp Val Glu Thr 

420 425 430 



Thr Gly Leu Ser Ala Arg Tyr Asp Gin He He Glu Leu Ala Ala Val 
435 440 445 



Lys Met Glu Asn Gly Glu He Val Ser Glu Phe Gin Glu Phe He Asp 
450 455 460 



Pro Gly Gin Pro Leu Ser Glu Thr Thr Thr Asn Leu Thr Gly He Thr 
465 470 475 480 



Asp Asp Met Val Gin Gly Ser Lys Ser Glu Asp Glu Val Leu His Ala 

485 490 495 



Phe Gin Ala Phe Ser Glu Gly Thr Val Leu Val Ala His Asn Ala Ser 

500 505 510 
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Phe Asp Met Gly Phe lie Asn Thr Ala Tyr Gin Arg His Gly Leu Gly 
515 520 525 



Gin Ala Asp Gin Pro Val lie Asp Thr Leu Glu Leu Ser Arg Met Leu 
530 535 540 



His Pro Asn Leu Lys Ser His Arg Leu Asn Thr Leu Ala Lys Arg Tyr 
545 550 555 560 

Asp Val Ala Leu Glu His His His Arg Ala He Tyr Asp Ser Glu Ser 

565 570 575 



Thr Ala Lys Leu Leu Trp lie Phe Leu Lys Glu Ala Lys Asp Gin Tyr 

580 585 590 

Asp Met Thr Ser His Gin Asp Leu Asn Ser Gin Val Gly Glu Gly Glu 
595 600 605 



Ala Tyr Lys Gin Ala Arg Pro Thr His Ala Ser He Leu Val Lys Asn 
610 615 620 



Gin Lys Gly Leu Lys Asn Leu Phe Lys He Val Ser His Ala His Val 
625 630 635 640 



Asn Tyr Phe Tyr Arg Val Pro Arg He Pro Lys Ser He Leu Ser Lys 

645 650 655 



Tyr Arg Glu Gly Leu Leu Val Gly Ser Gly Cys Gly Gin Gly Glu Leu 

660 665 670 



Phe Glu Ala He Met Gin Lys Gly Tyr Asp Glu Ala Leu Ala Val Ala 
675 680 685 



Gin Asp Tyr Asp Tyr He Glu Val Met Pro Lys Ser Ala Tyr He Asp 
690 695 700 



Leu Leu Asp Arg Asp Leu He Lys Asp Glu Ala Thr Leu Glu Glu Met 
705 710 715 720 



He Glu Asn Leu Val Lys He Gly His Glu Leu Asp He Pro Val Val 

725 730 735 
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Ala Thr Gly Asn Val His Tyr Leu Asn Pro Glu Asp Ala Val Leu Arg 

740 745 750 



Asp lie Leu Leu Glu Thr Ala Lys Lys Gly Ala Phe Ser Lys Ala Arg 
755 760 765 



Asn Pro Glu Val His Phe Arg Thr Thr Asp Glu Met Leu Glu Glu Phe 
770 775 780 



Ser Phe Leu Gly Gin Asp Gin Ala Tyr Glu lie Val Val Thr Asn Thr 
785 790 795 800 

Gin Lys lie Ala Asp Ser lie Glu Ser lie Ser Pro Val Lys Glu Gly 

805 810 815 



Leu Tyr Ala Pro Lys Met Glu Gly Ser Asp Gin Glu lie Arg Gin Met 

820 825 830 

Ser Tyr Lys Gin Ala Lys Ala Leu Tyr Gly Asp Pro Leu Pro Ser He 
835 840 845 



Val Glu Glu Arg Leu Glu Lys Glu Leu Lys Ser He He Asp Asn Asn 
850 855 860 



Phe Ser Val He Tyr Leu He Ser Gin Lys Leu Val Lys Lys Ser Val 

865 870 875 880 

Glu Asp Gly Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe 

885 890 895 



Val Ala Thr Met Thr Gly He Thr Glu Val Asn Pro Leu Pro Pro His 

900 905 910 



Tyr Arg Cys Pro Asn Cys Gin His Thr Glu Phe Phe Thr Asn Gly Glu 
915 920 925 



Val Gly Ser Gly Phe Asp Leu Glu Ala Lys Lys Cys Pro Glu Cys Gin 
930 935 940 



Ser Leu Met Glu Ser Asp Gly His Asp He Pro Phe Glu Thr Phe Leu 
945 950 955 960 



Gly Phe Asn Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly 



WO 03/104391 



157/235 



PCT/US02/36122 



965 970 975 



Glu Tyr Gin Ala Lys Ala His Asn Tyr Thr Lys Val Leu Phe Gly Glu 

980 985 990 



Asp His Val Tyr Arg Ala Gly Thr He Thr Thr Xle Ala Asp Lys Thr 
995 1000 1005 



Ala Phe Gly Phe Val Lys Gly Tyr Glu Arg Asp Lys Gin He Asn Tyr 
1010 1015 1020 

Arg Ser Ala Glu Val Asp Arg Leu Ser Asp Gly Leu Thr Gly Val Arg 
1025 1030 1035 1040 

Arg Ser Thr Gly Gin His Pro Gly Gly lie He Val He Pro Asp Asp 

1045 1050 1055 



Met Asp Val Phe Asp Phe Thr Pro He Gin Tyr Pro Ala Asp Asp Gin 

1060 1065 1070 

Thr Ala Glu Trp Gin Thr Thr His Phe Asp Phe His Ser He Asp Glu 
1075 1080 1085 

Asn Val Leu Lys Leu Asp lie Leu Gly His Asp Asp Pro Thr Met He 
1090 1095 1100 

Arg Lys Leu Gin Asp Leu Ser Gly Phe Asp Pro Gin Glu He Pro Val 
1105 1110 1115 1120 

Ser Asp Glu Asp Val Met Lys He Phe Ser Gly Pro Glu Val Leu Gly 

1125 1130 1135 



Val Thr Pro Glu Gin He Phe Ser Asn Thr Gly Thr Leu Gly Val Pro 

1140 1145 1150 



Glu Phe Gly Thr Gin Phe Val Arg Glu Met Leu Glu Gin Thr His Pro 
1155 1160 1165 



Ser Thr Phe Ala Glu Leu Leu Gin He Ser Gly Leu Ser His Gly Thr 
1170 1175 1180 



Asp Val Trp Leu Gly Asn Ala Glu Glu Leu He Arg Asn His Asn He 
1185 1190 1195 1200 
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Pro Leu Ser Glu Val lie Gly Cys Arg Asp Asp lie Met Val Tyr Leu 

1205 1210 1215 



Gin His Gin Gly Leu Glu Asp Ser Leu Ala Phe Lys lie Met Glu Phe 

1220 1225 1230 



Val Arg Lys Gly Arg Gly Leu Gin Asp Asp Trp lie Ala Thr Met Lys 
1235 1240 1245 



Glu Asn Asp Val Pro Asp Trp Tyr lie Glu Ser Cys Lys Lys lie Lys 
1250 1255 1260 



Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu Met Ala Leu 
1265 1270 1275 1280 



Arg Val Ala Tyr Phe Lys Val His Tyr Pro Leu Tyr Tyr Tyr Ala Ala 

1285 1290 1295 



Tyr Phe Ser lie Arg Ala Ser Asp Phe Asp Leu lie Ala Met Val Lys 

1300 1305 1310 



Gly Lys Glu Gly lie Lys Gly Ala Met Lys Glu lie Arg Asp Lys Glu 
1315 1320 1325 



Arg Glu Lys Thr Ala Thr Ala Lys Asp Lys Ala Leu Leu Thr Val Leu 
1330 1335 1340 



Glu Val Ala Asn Glu Met Val Glu Arg Gly Phe Asp Phe Lys Met Val 
1345 1350 1355 1360 



Asp lie Asn Lys Ser Gin Ala Lys Asp Phe Val lie Glu Asp Asn Gly 

1365 1370 1375 



Leu Arg Ala Pro Phe Arg Ala Val Pro Ser Leu Gly Ser Ser Ala Ala 

1380 1385 1390 



Gin Ala Val lie Asp Ala Arg Glu Asp Ser Asp Phe Leu Ser Lys Glu 
1395 1400 1405 



Asp Leu Ser Lys Arg Gly Lys Leu Ser Lys Thr Val Met Asp Tyr Leu 
1410 1415 1420 
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Asp Asn Asn His Val Leu Asp His Leu Pro Asp Glu Asn Gin Leu Ser 
1425 1430 1435 1440 



Leu Phe Asp Phe 



<210> 71 
<211> 1326 
<212> DMA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (19) . . (1326) 

<223> 

<400> 71 

aagaaaggag gactcaat atg tct atg ttt gtc gac tac acc aaa gtt aac 51 

Met Ser Met Phe Val Asp Tyr Thr Lys Val Asn 
15 10 



ctg aga gcc ggt aag ggc ggt gac gga atg gtg get ttt aga cga gaa 
Leu Arg Ala Gly Lys Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu 

15 20 25 



ttc cgc tac aac ccc cat ttt aag gca gat agt ggc caa aat ggt atg 
Phe Arg Tyr Asn Pro His Phe Lys Ala Asp Ser Gly Gin Asn Gly Met 
60 65 70 75 



ccg cct gga acc att ate egg gat gcc caa agt aag get ata ctt get 
Pro Pro Gly Thr lie He Arg Asp Ala Gin Ser Lys Ala He Leu Ala 

95 100 105 



gga ggt egg ggc aat aaa cgt ttt get acg cat aag aac cca gca ccc 
Gly Gly Arg Gly Asn Lys Arg Phe Ala Thr His Lys Asn Pro Ala Pro 
125 130 135 

tec att gcc gaa aac ggc gag ccg ggc caa gag egg gat gtc gaa ttg 
Ser He Ala Glu Asn Gly Glu Pro Gly Gin Glu Arg Asp Val Glu Leu 



99 



aag tat gag ccc aat ggt gga cca gca ggc ggc gac ggt ggc agt ggc 147 
Lys Tyr Glu Pro Asn Gly Gly Pro Ala Gly Gly Asp Gly Gly Ser Gly 
30 35 40 

ggt aac att ate ttc aag gta gat gaa ggc etc cgt acc ctg gta gac 195 
Gly Asn He He Phe Lys Val Asp Glu Gly Leu Arg Thr Leu Val Asp 
45 50 55 



243 



ccc aag ggg atg aat. ggt aag aag gca gag gac ttg att ate agt gtc 291 
Pro Lys Gly Met Asn Gly Lys Lys Ala Glu Asp Leu He He Ser Val 

80 85 90 



339 



gac tta caa gaa gaa gga caa gaa gtc ttg gca gcc caa ggt ggc egg 387 
Asp Leu Gin Glu Glu Gly Gin Glu Val Leu Ala Ala Gin Gly Gly Arg 
110 115 120 



435 



483 
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140 145 150 155 

gaa tta aaa gtc atg gcc gat gtt ggc eta gtg ggt tat cct tct gtc 
Glu Leu Lys Val Met Ala Asp Val Gly Leu Val Gly Tyr Pro Ser Val 

160 165 170 

ggg aaa teg acc ctt ttg teg gtt gtc tea ggc get aaa ccc aaa att 
Gly Lys Ser Thr Leu Leu Ser Val Val Ser Gly Ala Lys Pro Lys lie 

175 180 185 

gga gcc tat cac ttt act aca ctt gcc cct aat tta ggt gta gtg aat 
Gly Ala Tyr His Phe Thr Thr Leu Ala Pro Asn Leu Gly Val Val Asn 
190 195 200 

gca gtg gac ggc aag gaa ttt gtc ttg gcg gat att cct ggc tta att 
Ala Val Asp Gly Lys Glu Phe Val Leu Ala Asp lie Pro Gly Leu lie 
205 210 215 

gaa ggg get tea gaa ggg gtt ggt ttg ggg att gac ttc etc aag cat 
Glu Gly Ala Ser Glu Gly Val Gly Leu Gly lie Asp Phe Leu Lys His 
220 225 230 235 

att gaa aga acc cgc ate etc ctt cat gta ctt gat atg age gga atg 
lie Glu Arg Thr Arg lie Leu Leu His Val Leu Asp Met Ser Gly Met 

240 245 250 

gaa ggt cgc cat cca att gat gat ttt gac cag att aac caa gaa eta 
Glu Gly Arg His Pro lie Asp Asp Phe Asp Gin lie Asn Gin Glu Leu 

255 260 265 

aaa gac tat aat gag aaa tta ttg gac cgc aag cag gtc att gtg gcc 
Lys Asp Tyr Asn Glu Lys Leu Leu Asp Arg Lys Gin Val lie Val Ala 
270 275 280 

aat aaa atg gac ctg ccc cag tec egg gat aat tta ate gaa ttt aaa 
Asn Lys Met Asp Leu Pro Gin Ser Arg Asp Asn Leu lie Glu Phe Lys 
285 290 295 

gcc gag tta gac age egg gac ctt gac tat gaa ate ttt gaa gtg tea 
Ala Glu Leu Asp Ser Arg Asp Leu Asp Tyr Glu Xle Phe Glu Val Ser 
300 305 310 315 

get gcc acc cag get ggc att cag gac eta gtc ate cga eta gcc gac 
Ala Ala Thr Gin Ala Gly lie Gin Asp Leu Val lie Arg Leu Ala Asp 

320 325 330 

tta gtc gac caa ctg gac caa gcc cca agt tta gac cag gaa gaa act 
Leu Val Asp Gin Leu Asp Gin Ala Pro Ser Leu Asp Gin Glu Glu Thr 

335 340 345 

agt gaa gcc gac caa aga gtg gtc tac aag ttt caa get gac caa gac 
Ser Glu Ala Asp Gin Arg Val Val Tyr Lys Phe Gin Ala Asp Gin Asp 
350 355 360 

aaa ttt gac ctt gac cgc gac cct gaa ggg gta tgg ttg gtt tct ggt 
Lys Phe Asp Leu Asp Arg Asp Pro Glu Gly Val Trp Leu Val Ser Gly 
365 370 375 
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ccc aag gtt gag cgt ttg tat gcc atg acc aat ttt gac cac gag gaa 
Pro Lys Val Glu Arg Leu Tyr Ala Met Thr Asn Phe Asp His Glu Glu 
380 385 390 395 



1203 



gcc att atg egg ttt tct cgc cag eta aga ggg atg gga gta gac caa 1251 
Ala lie Met Arg Phe Ser Arg Gin Leu Arg Gly Met Gly Val Asp Gin 

400 405 410 



gcc tta aga gac aag ggg get cag tct ggt gac etc gtc caa gtt gaa 
Ala Leu Arg Asp Lys Gly Ala Gin Ser Gly Asp Leu Val Gin Val Glu 

415 420 425 



1299 



gat ttt gtc ttt gag ttc atg gat tag 1326 
Asp Phe Val Phe Glu Phe Met Asp 
430 435 



<210> 72 
<211> 435 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 72 

Met Ser Met Phe Val Asp Tyr Thr Lys Val Asn Leu Arg Ala Gly Lys 
15 10 15 



Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Glu Pro Asn 

20 25 30 

Gly Gly Pro Ala Gly Gly Asp Gly Gly Ser Gly Gly Asn lie He Phe 
35 40 45 



Lys Val Asp Glu Gly Leu Arg Thr Leu Val Asp Phe Arg Tyr Asn Pro 
50 55 60 



His Phe Lys Ala Asp Ser Gly Gin Asn Gly Met" Pro Lys Gly Met Asn 
65 70 75 80 

Gly Lys Lys Ala Glu Asp Leu He He Ser Val Pro Pro Gly Thr He 

85 90 95 



He Arg Asp Ala Gin Ser Lys Ala He Leu Ala Asp Leu Gin Glu Glu 

100 105 HO 

Gly Gin Glu Val Leu Ala Ala Gin Gly Gly Arg Gly Gly Arg Gly Asn 
115 120 125 



Lys Arg Phe Ala Thr His Lys Asn Pro Ala Pro Ser He Ala Glu Asn 
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130 135 140 

Gly Glu Pro Gly Gin Glu Arg Asp Val Glu Leu Glu Leu Lys Val Met 
145 150 155 160 

Ala Asp Val Gly Leu Val Gly Tyr Pro Ser Val Gly Lys Ser Thr Leu 

165 170 175 

Leu Ser Val Val Ser Gly Ala Lys Pro Lys lie Gly Ala Tyr His Phe 

180 185 190 

Thr Thr Leu Ala Pro Asn Leu' Gly Val Val Asn Ala Val Asp Gly Lys 
195 200 205 

Glu Phe Val Leu Ala Asp He Pro Gly Leu He Glu Gly Ala Ser Glu 
210 215 220 

Gly Val Gly Leu Gly He Asp Phe Leu Lys His He Glu Arg Thr Arg 
225 230 235 240 

He Leu Leu His Val Leu Asp Met Ser Gly Met Glu Gly Arg His Pro 

245 250 255 

He Asp Asp Phe Asp Gin He Asn Gin Glu Leu Lys Asp Tyr Asn Glu 

260 265 270 

Lys Leu Leu Asp Arg Lys Gin Val He Val Ala Asn Lys Met Asp Leu 
275 280 285 

Pro Gin Ser Arg Asp Asn Leu He Glu Phe Lys Ala Glu Leu Asp Ser 
290 295 300 

Arg Asp Leu Asp Tyr Glu He Phe Glu Val Ser Ala Ala Thr Gin Ala 
305 310 315 320 

Gly He Gin Asp Leu Val He Arg Leu Ala Asp Leu Val Asp Gin Leu 

325 330 335 

Asp Gin Ala Pro Ser Leu Asp Gin Glu Glu Thr Ser Glu Ala Asp Gin 

340 345 350 



Arg Val Val Tyr Lys Phe Gin Ala Asp Gin Asp Lys Phe Asp Leu Asp 
355 360 365 
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Arg Asp Pro Glu Gly Val Trp Leu Val Ser Gly Pro Lys Val Glu Arg 
370 375 380 



Leu Tyr Ala Met Thr Asn Phe Asp His Glu Glu Ala lie Met Arg Phe 
385 390 395 400 



Ser Arg Gin Leu Arg Gly Met Gly Val Asp Gin Ala Leu Arg Asp Lys 

405 410 415 



Gly Ala Gin Ser Gly Asp Leu Val Gin Val Glu Asp Phe Val Phe Glu 

420 425 430 



Phe Met Asp 
435 



<210> 73 
<211> 1338 
<212> DNA 

<213> Allbiococcus otitidis 

<220> 
<221> CDS 

<222> (25) . . (1338) 
<223> 

<400> 73 

aagagaaaga aagaaggtgt actg atg get aat cct tta gta gec ata ate 51 

Met Ala Asn Pro Leu Val Ala lie lie 
1 5 

ggc egg cct aat gtc ggc aag tea act att ttc aac egg att att gga 99 
Gly Arg Pro Asn Val Gly Lys Ser Thr lie Phe Asn Arg lie lie Gly 
10 15 20-25 

gac cgc tta gee att gtc cag gat gaa ccc ggg gtc ace egg gac cgt 147 
Asp Arg Leu Ala He Val Gin Asp Glu Pro Gly Val Thr Arg Asp Arg 

30 35 40 

att tat gec gat get gaa tgg ttg ggc aaa gac ttt tct gtt ata gat 195 
He Tyr Ala. Asp Ala Glu Trp Leu Gly Lys Asp Phe Ser Val He Asp 

45 50 55 



acg gga gga ate act ttt gat gat ttg ccc ttg cat gaa gaa ata aaa 
Thr Gly Gly He Thr Phe Asp Asp Leu Pro Leu His Glu Glu He Lys 
60 65 70 



243 



gtc caa get gaa att gec att gat gaa gca gat gtc ate gtc atg gta 291 
Val Gin Ala Glu He Ala He Asp Glu Ala Asp Val He Val Met Val 
75 80 85 
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acc agt gtc aaa gag ggc att aca gac ttg gat gac cag gta gcc tta 
Thr Ser Val Lys Glu Gly lie Thr Asp Leu Asp Asp Gin Val Ala Leu 
90 95 100 105 

att ttg cag cag tec aac aaa ccc gtg gtc ctt get gtt aat aaa aca 
lie Leu Gin Gin Ser Asn Lys Pro Val Val Leu Ala Val Asn Lys Thr 

110 115 120 

gat aat cct gag ctt aga aat gaa ata tat gag ttt tac ggg tta ggc 
Asp Asn Pro Glu Leu Arg Asn Glu lie Tyr Glu Phe Tyr Gly Leu Gly 

125 130 135 

ttg ggt gac ccc ctt ccg gta tec ggg tct cac ggc eta ggc ttt ggg 
Leu Gly Asp Pro Leu Pro Val Ser Gly Ser His Gly Leu Gly Phe Gly 
140 145 150 

gac etc tta gac gca gtg gtg gcc aac ttt cct aat gag gcc aat atg 
Asp Leu Leu Asp Ala Val Val Ala Asn Phe Pro Asn Glu Ala Asn Met 
155 160 165 

get tat gac caa gat acc att aag ttc tgc ttg att ggt cgt ccc aat 
Ala Tyr Asp Gin Asp Thr lie Lys Phe Cys Leu lie Gly Arg Pro Asn 
170 175 180 185 

gtt ggc aag tct age eta gtt aat get att att ggg gaa gac egg gtt 
Val Gly Lys Ser Ser Leu Val Asn Ala He lie Gly Glu Asp Arg Val 

190 195 200 

ata gtc tct gaa eta gaa ggg acc acc egg gat gca att gac act ccc 
He Val Ser Glu Leu Glu Gly Thr Thr Arg Asp Ala lie Asp Thr Pro 

205 210 215 

ttt atg acc cag gat ggc cag gac tat gtt atg ate gat act get ggg 
Phe Met Thr Gin Asp Gly Gin Asp Tyr Val Met He Asp Thr Ala Gly 
220 225 230 

ate egg cgt egg ggc aag gtc tat gaa aaa act gaa aag tat tct gtt 
He Arg Arg Arg Gly Lys Val Tyr Glu Lys Thr Glu Lys Tyr Ser Val 
235 240 245 

atg egg gca cag cga get ate gac egg tct gat gtg gtc ttg tgt gtc 
Met Arg Ala Gin Arg Ala He Asp Arg Ser Asp Val Val Leu Cys Val 
250 255 260 265 

ctg gat get gaa aca ggc att aga gac caa gat aag aag gtt ttc ggc 
Leu Asp Ala Glu Thr Gly He Arg Asp Gin Asp Lys Lys Val Phe Gly 

270 275 280 

tat get cat caa gcc ggc aag gga att att att tta gtc aat aag tgg 
Tyr Ala His Gin Ala Gly Lys Gly He He He Leu Val Asn Lys Trp 

285 290 295 

gac acg att aaa aaa gag act aac acc atg cga gac ttt gag ttg caa 
Asp Thr He Lys Lys Glu Thr Asn Thr Met Arg Asp Phe Glu Leu Gin 
300 305 310 



att cgc gac caa ttc cgc tac etc cac tat gcc cca ate ctt ttc gtc 
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lie Arg Asp Gin Phe Arg Tyr Leu His Tyr Ala Pro lie Leu Phe Val 
315 320 325 

tea gec aag acc aag cag aga ctg gaa gtc ate ccg gaa ttg gtc gac 1059 
Ser Ala Lys Thr Lys Gin Arg Leu Glu Val lie Pro Glu Leu Val Asp 
330 335 340 345 

egg gtc tat tat aac cgc aat caa egg gtc aag tec tec etc tta aat 1107 
Arg Val Tyr Tyr Asn Arg Asn Gin Arg Val Lys Ser Ser Leu Leu Asn 

350 355 360 

gat gtg ctg agt gat gca eta gee age aat cct gca cct agt aag tea 1155 
Asp Val Leu Ser Asp Ala Leu Ala Ser Asn Pro Ala Pro Ser Lys Ser 

365 370 375 

ggg aag cga etc aag gtc ttt tat gcg acc cag gta gee act aat cca 1203 
Gly Lys Arg Leu Lys Val Phe Tyr Ala Thr Gin Val Ala Thr Asn Pro 
380 385 390 

cct act ttt gtg gtt ttt gtc aat gat cct gac etc atg cac ttc tec 1251 
Pro Thr Phe Val Val Phe Val Asn Asp Pro Asp Leu Met His Phe Ser 
395 400 405 



tat gag cgc ttt tta gaa aat cga ttc cgc gaa age ttt gac ttc tat 
Tyr Glu Arg Phe Leu Glu Asn Arg Phe Arg Glu Ser Phe Asp Phe Tyr 
410 415 420 425 



1299 



ggc act ccg att cag ata ate cct aga gca agg aaa taa 133 8 

Gly Thr Pro lie Gin lie lie Pro Arg Ala Arg Lys 

430 435 



<210> 74 
<211> 437 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 74 

Met Ala Asn Pro Leu Val Ala He He Gly Arg Pro Asn Val Gly Lys 
1 5 10 15 



Ser Thr He Phe Asn Arg lie He Gly Asp Arg Leu Ala He Val Gin 

20 25 30 



Asp Glu Pro Gly Val Thr Arg Asp Arg He Tyr Ala Asp Ala Glu Trp 
35 40 45 



Leu Gly Lys Asp Phe Ser Val He Asp Thr Gly Gly He Thr Phe Asp 
50 55 60 



Asp Leu Pro Leu His Glu Glu He Lys Val Gin Ala Glu He Ala He 
65 70 75 80 
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Asp Glu Ala Asp Val He Val Met Val Thr Ser Val Lys Glu Gly He 

85 90 95 



Thr Asp Leu Asp Asp Gin Val Ala Leu He Leu Gin Gin Ser Asn Lys 

100 105 HO 



Pro Val Val Leu Ala Val Asn Lys Thr Asp Asn Pro Glu Leu Arg Asn 
115 120 125 

Glu He Tyr Glu Phe Tyr Gly Leu Gly Leu Gly Asp Pro Leu Pro Val 
130 135 140 

Ser Gly Ser His Gly Leu Gly Phe Gly Asp Leu Leu Asp Ala Val Val 
145 150 155 160 

Ala Asn Phe Pro Asn Glu Ala Asn Met Ala Tyr Asp Gin Asp Thr He 

165 170 175 

Lys Phe Cys Leu He Gly Arg Pro Asn Val Gly Lys Ser Ser Leu Val 

180 185 190 

Asn Ala He He Gly Glu Asp Arg Val He Val Ser Glu Leu Glu Gly 
195 200 205 

Thr Thr Arg Asp Ala He Asp Thr Pro Phe Met Thr Gin Asp Gly Gin 
210 215 220 

Asp Tyr Val Met He Asp Thr Ala Gly He Arg Arg Arg Gly Lys Val 
225 230 235 240 

Tyr Glu Lys Thr Glu Lys Tyr Ser Val Met Arg Ala Gin Arg Ala He 

245 250 255 

Asp Arg Ser Asp Val Val Leu Cys Val Leu Asp Ala Glu Thr Gly He 

260 265 270 



Arg Asp Gin Asp Lys Lys Val Phe Gly Tyr Ala His Gin Ala Gly Lys 
275 280 285 



Gly He He He Leu Val Asn Lys Trp Asp Thr He Lys Lys Glu Thr 
290 295 300 
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Asn Thr Met Arg Asp Phe Glu Leu Gin lie Arg Asp Gin Phe Arg Tyr 
305 310 315 320 



Leu His Tyr Ala Pro lie Leu Phe Val Ser Ala Lys Thr Lys Gin Arg 

325 330 335 



Leu Glu Val lie Pro Glu Leu Val Asp Arg Val Tyr Tyr Asn Arg Asn 

340 345 350 



Gin Arg Val Lys Ser Ser Leu Leu Asn Asp Val Leu Ser Asp Ala Leu 
355 360 365 



Ala Ser Asn Pro Ala Pro Ser Lys Ser Gly Lys Arg Leu Lys Val Phe 
370 375 380 

Tyr Ala Thr Gin Val Ala Thr Asn Pro Pro Thr Phe Val Val Phe Val 
385 390 395 400 



Asn Asp Pro Asp Leu Met His Phe Ser Tyr Glu Arg Phe Leu Glu Asn 

405 410 415 



Arg Phe Arg Glu Ser Phe Asp Phe Tyr Gly Thr Pro lie Gin lie lie 

420 425 430 



Pro Arg Ala Arg Lys 
435 



<210> 75 
<211> 3324 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (10) . . (3324) 

<223> 

<400> 75 

aataaaaga ttg aaa caa ata tgt ctt aga cga aga ggt gac aag atg act 51 

Met Lys Gin He Cys Leu Arg Arg Arg Gly Asp Lys Met Thr 

1 5 10 



ttt acc cac tta caa gtg acc agt get tac acc ttg atg get teg ace 
Phe Thr His Leu Gin Val Thr Ser Ala Tyr Thr Leu Met Ala Ser Thr 
15 20 25 30 

ate caa ttg ccc etc ctg atg gac cgc ctg aag gag ctt ggc atg gag 
He Gin Leu Pro Leu Leu Met Asp Arg Leu Lys Glu Leu Gly Met Glu 



99 



147 
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35 40 45 

get gtt gec ttg acc gac cac aat gtt atg cat gga gcg gtc gaa ttt 195 
Ala Val Ala Leu Thr Asp His Asn Val Met His Gly Ala Val Glu Phe 

50 55 60 



tac caa gaa gec aaa aag cat ggc att aaa ccc att atg gga eta egg 
Tyr Gin Glu Ala Lys Lys His Gly lie Lys Pro lie Met Gly Leu Arg 
65 70 75 



age cac ate aaa gec aac cag aaa att caa ttt gac acc cag get egg 
Ser His He Lys Ala Asn Gin Lys He Gin Phe Asp Thr Gin Ala Arg 

210 215 220 



tea gta gac tgg tec ctg gac etc ggt cag get aaa ttg cct gca ttt 
Ser Val Asp Trp Ser Leu Asp Leu Gly Gin Ala Lys Leu Pro Ala Phe 
255 260 265 270 



243 



get gac eta gac gaa gga ata acc gtc acc etc ctg get aaa aac aag 291 
Ala Asp Leu Asp Glu Gly He Thr Val Thr Leu Leu Ala Lys Asn Lys 
80 85 90 

get ggc tac cag get etc tta gee tta teg act gac ctt caa gtt aac 339 
Ala Gly Tyr Gin Ala Leu Leu Ala Leu Ser Thr Asp Leu Gin Val Asn 
95 100 105 110 

aag cag get att aca ctt gac caa gtc cgt tct gtg gee cag gac etc 387 
Lys Gin Ala He Thr Leu Asp Gin Val Arg Ser Val Ala Gin Asp Leu 

115 120 125 

tat aca ata ttc cca age tct gac cca aaa gtg aaa gca gac etc tta 435 
Tyr Thr He Phe Pro Ser Ser Asp Pro Lys Val Lys Ala Asp Leu Leu 

130 135 140 

gat aag cag gca age aat ttg acc gcg atg act cag aac ctg ccc cat 483 
Asp Lys Gin Ala Ser Asn Leu Thr Ala Met Thr Gin Asn Leu Pro His 
145 150 155 

tea tat ttg ggt ctg gtg cca gac caa gat caa aaa att tac cag tta 531 
Ser Tyr Leu Gly Leu Val Pro Asp Gin Asp Gin Lys He Tyr Gin Leu 
160 165 170 

gee egg acc ttg tea gat tct gga ggt ttg aaa gtc tta gee tta tct 579 
Ala Arg Thr Leu Ser Asp Ser Gly Gly Leu Lys Val Leu Ala Leu Ser 
175 180 185 190 

gac gtc cgt tgc ttg gaa gaa age caa gtc tec act ttg gaa ate tta 627 
Asp Val Arg Cys Leu Glu Glu Ser Gin Val Ser Thr Leu Glu He Leu 

195 200 205 



675 



gaa aat tat gec ctg cgc agt ccc caa gaa atg gag tct ttt ttt aac 723 

Glu Asn Tyr Ala Leu Arg Ser Pro Gin Glu Met Glu Ser Phe Phe Asn 
225 230 235 

cag gtg ggt tta ggt cag gee ctt aaa aat act aaa gat gta gec cag 771 

Gin Val Gly Leu Gly Gin Ala Leu Lys Asn Thr Lys Asp Val Ala Gin 

240 245 250 



819 
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gac ctg ccg gaa ggg gag acc aag gac tec tac ctt ggc aag ctt gec 
Asp Leu Pro Glu Gly Glu Thr Lys Asp Ser Tyr Leu Gly Lys Leu Ala 

275 280 285 



867 



caa aaa gga etc caa gaa egg gtt cca ggc tac ggc caa gac tac caa 915 
Gin Lys Gly Leu Gin Glu Arg Val Pro Gly Tyr Gly Gin Asp Tyr Gin 

290 295 300 



gac cgt eta gac aag gaa eta gcg gtt att tct tec atg ggc ttt teg 
Asp Arg Leu Asp Lys Glu Leu Ala Val lie Ser Ser Met Gly Phe Ser 
305 310 315 



aaa att gag act ggt ttt ggc egg ggg tea get gec get tct ttg gta 
Lys lie Glu Thr Gly Phe Gly Arg Gly Ser Ala Ala Ala Ser Leu Val 
335 340 345 350 

tct tat gec etc tac att acg ggg gta gat ccc ate cat tat gac etc 
Ser Tyr Ala Leu Tyr lie Thr Gly Val Asp Pro lie His Tyr Asp Leu 

355 360 365 

etc ttt gaa cgt ttt ttg aac aag gac cgc ttt acc atg cct gat att 
Leu Phe Glu Arg Phe Leu Asn Lys Asp Arg Phe Thr Met Pro Asp He 

370 375 380 

gac eta gac ttc cca gac aac aag cgc cag gtc ate ttg gac tat gtc 
Asp Leu Asp Phe Pro Asp Asn Lys Arg Gin Val He Leu Asp Tyr Val 
385 390 395 



acc ttt gcg get aag tec tec ate agg gaa att atg egg acc ttg ggt 
Thr Phe Ala Ala Lys Ser Ser He Arg Glu He Met Arg Thr Leu Gly 
415 420 425 430 

tac aag aat gaa gac atg aag acc tgg tec cag gee ata cca gat acc 
Tyr Lys Asn Glu Asp Met Lys Thr Trp Ser Gin Ala He Pro Asp Thr 

435 440 445 



963 



gac tac ttc ctg att gtt tgg gac ctg atg caa ttt gec cgc cag gaa 1011 
Asp Tyr Phe Leu He Val Trp Asp Leu Met Gin Phe Ala Arg Gin Glu 
320 325 330 



1059 



1107 



1155 



1203 



tac egg aag tat ggt cct gac cat gtg gee caa att ttg acc ttt ggg 1251 
Tyr Arg Lys Tyr Gly Pro Asp His Val Ala Gin He Leu Thr Phe Gly 
400 405 410 



1299 



1347 



gtc aac ate age ttg tea aag gee tat gac gag teg aaa gac ctt caa 13 95 

Val Asn He Ser Leu Ser Lys Ala Tyr Asp Glu Ser Lys Asp Leu Gin 

450 455 460 

aaa ctg gtc cag caa age cat gaa aat gag egg ate ttt gec atg gec 1443 
Lys Leu Val Gin Gin Ser His Glu Asn Glu Arg He Phe Ala Met Ala 
465 470 475 

cag gat ate gaa ggc ctg cca agg aac tat tea acc cat gcg gee ggt 1491 
Gin Asp He Glu Gly Leu Pro Arg Asn Tyr Ser Thr His Ala Ala Gly 
480 485 490 
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gtc gtc atg tea gac cag ccc eta ate cat tec ctt ccc eta caa gat 1539 
Val Val Met Ser Asp Gin Pro Leu lie His Ser Leu Pro Leu Gin Asp 
495 500 505 510 

ggc aac gga aag gtc ccc aac acc caa ttt ace atg gag gat gtt gaa 1587 
Gly Asn Gly Lys Val Pro Asn Thr Gin Phe Thr Met Glu Asp Val Glu 

515 520 525 

gcg gtc ggc tta etc aag atg gac ttt ttg agt tta aaa aat tta acc 1635 
Ala Val Gly Leu Leu Lys Met Asp Phe Leu Ser Leu Lys Asn Leu Thr 

530 535 540 

ate eta gca gac tgc ttg aac ttt age cag tat gaa ggg cag gga ggg 1683 
lie Leu Ala Asp Cys Leu Asn Phe Ser Gin Tyr Glu Gly Gin Gly Gly 
545 550 555 

ggt ata agt aaa caa gat ata cca ate gac gac cct aag acc ctg gat 1731 
Gly lie Ser Lys Gin Asp lie Pro He Asp Asp Pro Lys Thr Leu Asp 
560 565 570 

ctt ttt gee egg gga gac aca aat ggg gtc ttc caa ttt gaa aaa gag 1779 
Leu Phe Ala Arg Gly Asp Thr Asn Gly Val Phe Gin Phe Glu Lys Glu 
575 580 585 590 

gga ate aaa aaa gtc etc cgc cag ctt caa ccc act tct ttt gaa gat 1827 
Gly He Lys Lys Val Leu Arg Gin Leu Gin Pro Thr Ser Phe Glu Asp 

595 600 605 

ate gtc gee acc aac gec etc tac cgc ccc ggt ccc atg ggg caa att 1875 
He Val Ala Thr Asn Ala Leu Tyr Arg Pro Gly Pro Met Gly Gin He 

610 615 620 

gag aat tat att aac cgt aaa cat ggt caa gaa aaa att ate tac ccc 1923 
Glu Asn Tyr He Asn Arg Lys His Gly Gin Glu Lys He He Tyr Pro 
625 630 635 

cat gaa gac tta aag gac ate ctt gaa gtc act tat ggc att att gtc 1971 
His Glu Asp Leu Lys Asp He Leu Glu Val Thr Tyr Gly He He Val 
640 645 650 

tac cag gaa caa gtc atg cag gta get acc caa eta get ggc tat agt 2019 
Tyr Gin Glu Gin Val Met Gin Val Ala Thr Gin Leu Ala Gly Tyr Ser 
655 660 665 670 

ttg teg gaa get gac caa ttg egg egg act atg tec aaa aaa ate cag 2067 
Leu Ser Glu Ala Asp Gin Leu Arg Arg Thr Met Ser Lys Lys He Gin 

675 680 685 

tea gaa atg gac cag gga egg gaa aaa ttt ata aga gga gee ttg gac 2115 
Ser Glu Met Asp Gin Gly Arg Glu Lys Phe He Arg Gly Ala Leu Asp 

690 695 700 

aag ggc tac agt gag tea gta gee cga gag gtt tat aac tat att gca 2163 
Lys Gly Tyr Ser Glu Ser Val Ala Arg Glu Val Tyr Asn Tyr He Ala 
705 710 715 



aag ttt get aac tac ggc ttt aac cgt gec cat get gtt gec tac tec 



2211 
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Lys Phe Ala Asn Tyr Gly Phe Asn Arg Ala His Ala Val Ala Tyr Ser 

720 725 730 

atg ctt gcc tac cat atg gcc tac ttt aag gtc cac cag cct aaa tct 2259 

Met Leu Ala Tyr His Met Ala Tyr Phe Lys Val His Gin Pro Lys Ser 

735 740 745 750 



ttt ttt gcg get gtg atg aag gca gac tgg ggt aac aag get aaa att 
Phe Phe Ala Ala Val Met Lys Ala Asp Trp Gly Asn Lys Ala Lys lie 

755 760 765 



cca gac ate aac caa age ctt gga tct ttt acg gtt egg cag aat ggc 
Pro Asp lie Asn Gin Ser Leu Gly Ser Phe Thx Val Arg Gin Asn Gly 
785 790 795 



2307 



tac aag tat gcc cat gaa gtc egg get aga aaa att aaa eta eta aaa 2355 
Tyr Lys Tyr Ala His Glu Val Arg Ala Arg Lys lie Lys Leu Leu Lys 

770 775 780 



2403 



att caa gtg ggg ctt aag atg gtc aag ggg gtg get age ccc ttt gtc 2451 
He Gin Val Gly Leu Lys Met Val Lys Gly Val Ala Ser Pro Phe Val 
800 805 810 

aac cac ate ctt gaa att egg aaa gaa aag gga get ttt acc age ctg 2499 
Asn His He Leu Glu He Arg Lys Glu Lys Gly Ala Phe Thr Ser Leu 
815 820 825 830 

cgt gac ttt tgt gaa aaa att gac age caa ttc tta agt caa gac ccc 2547 
Arg Asp Phe Cys Glu Lys He Asp Ser Gin Phe Leu Ser Gin Asp Pro 

835 840 845 

att gaa gca ttg att ttg gtg ggg gcc ttt gac caa atg ggc cct aat 2595 
He Glu Ala Leu He Leu Val Gly Ala Phe Asp Gin Met Gly Pro Asn 

850 855 860 

egg egg acc atg tta gcg ggc ttg gaa gca acg att gaa ttc gtg gcc 2643 
Arg Arg Thr Met Leu Ala Gly Leu Glu Ala Thr He Glu Phe Val Ala 
865 870 875 

aaa agt teg ggc aat ate acc ctt ttt gac act etc aag ccc cgc caa 2691 
Lys Ser Ser Gly Asn He Thr Leu Phe Asp Thr Leu Lys Pro Arg Gin 
880 885 890 

gaa gac ctg gaa gag ttt age cca aag gac etc att caa tat gaa gaa 2739 
Glu Asp Leu Glu Glu Phe Ser Pro Lys Asp Leu He Gin Tyr Glu Glu 
895 900 905 910 

gaa tta acc ggt ttt tac ttc tec age cac ccc ttg age egg tat gac 2787 
Glu Leu Thr Gly Phe Tyr Phe Ser Ser His Pro Leu Ser Arg Tyr Asp 

915 920 925 

tec ctg cga cag gac tta aaa acg tec ttt ata get gat tta gaa gag 2835 
Ser Leu Arg Gin Asp Leu Lys Thr Ser Phe He Ala Asp Leu Glu Glu 

930 935 940 



ggc caa tct tgc caa gtt tta ggt cag ctg gtt caa gtc egg aaa act 
Gly Gin Ser Cys Gin Val Leu Gly Gin. Leu Val Gin Val Arg Lys Thr 



2883 
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945 950 955 

cag act aga aac caa caa ccc atg gcc ttt gtt age ctg get gac caa 
Gin Thr Arg Asn Gin Gin Pro Met Ala Phe Val Ser Leu Ala Asp Gin 
960 965 970 

aca gga caa att age ctg gtg gtc ttt ccg aat gta tac cgc gaa tgc 
Thr Gly Gin lie Ser Leu Val Val Phe Pro Asn Val Tyr Arg Glu Cys 
975 980 985 990 

eta cct tac etc aaa gaa gga gtg gtc ctg gtc gtc tea ggc aag gta 
Leu Pro Tyr Leu Lys Glu Gly Val Val Leu Val Val Ser Gly Lys Val 

995 1000 1005 

gaa gtt agg aag gga gaa ate cag eta aaa gtc cag acc atg aaa gag 
Glu Val Arg Lys Gly Glu He Gin Leu Lys Val Gin Thr Met Lys Glu 

1010 1015 1020 



gcc cga cat ccc ggc cag aag cga gtg att gtt tac gac cag gcc age 
Ala Arg His Pro Gly Gin Lys Arg Val He Val Tyr Asp Gin Ala Ser 
1055 1060 1065 1070 



tta aaa taa 
Leu Lys.- 



2931 



2979 



3027 



3075 



gcc age cag gtc caa aaa gag act aag cag ctt tac ctg aaa ttt get 3123 
Ala Ser Gin Val Gin Lys Glu Thr Lys Gin Leu Tyr Leu Lys Phe Ala 
1025 1030 1035 

gac ttg aac caa gat aaa gaa agt ttt cgt caa gtg caa aag ate ttg 3171 
Asp Leu Asn Gin Asp Lys Glu Ser Phe Arg Gin Val Gin Lys He Leu 
1040 1045 1050 



3219 



cag caa gca etc cag etc aaa gca aaa ttt aat ttc gac gga egg acg 3267 
Gin Gin Ala Leu Gin Leu Lys Ala Lys Phe Asn Phe Asp Gly Arg Thr 

1075 1080 1085 

gat acc eta aac cag etc cag gac etc eta ggc cag gat tct tgt ate 3315 
Asp Thr Leu Asn Gin Leu Gin Asp Leu Leu Gly Gin Asp Ser Cys He 

1090 1095 1100 



3324 



<210> 76 
<211> 1104 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 76 

Met Lys Gin He Cys Leu Arg Arg Arg Gly Asp Lys Met Thr Phe Thr 
15 10 15 



His Leu Gin Val Thr Ser Ala Tyr Thr Leu Met Ala Ser Thr He Gin 

20 25 30 



Leu Pro Leu Leu Met Asp Arg Leu Lys Glu Leu Gly Met Glu Ala Val 
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Ala Leu Thr Asp His Asn Val Met His Gly Ala Val Glu Phe Tyr Gin 
50 55 60 

Glu Ala Lys Lys His Gly He Lys Pro He Met Gly Leu Arg Ala Asp 
65 70 75 80 

Leu Asp Glu Gly He Thr Val Thr Leu Leu Ala Lys Asn Lys Ala Gly 

85 90 95 

Tyr Gin Ala Leu Leu Ala Leu Ser Thr Asp Leu Gin Val Asn Lys Gin 

100 105 HO 

Ala He Thr Leu Asp Gin Val Arg Ser Val Ala Gin Asp Leu Tyr Thr 
115 120 125 

He Phe Pro Ser Ser Asp Pro Lys Val Lys Ala Asp Leu Leu Asp Lys 
130 135 140 

Gin Ala Ser Asn Leu Thr Ala Met Thr Gin Asn Leu Pro His Ser Tyr 
145 150 155 160 

Leu Gly Leu Val Pro Asp Gin Asp Gin Lys He Tyr Gin Leu Ala Arg 

165 170 1*75 

Thr Leu Ser Asp Ser Gly Gly Leu Lys Val Leu Ala Leu Ser Asp Val 

180 185 190 

Arg Cys Leu Glu Glu Ser Gin Val Ser Thr Leu Glu He Leu Ser His 
195 200 205 

He Lys Ala Asn Gin Lys He Gin Phe Asp Thr Gin Ala Arg Glu Asn 
210 215 220 

Tyr Ala Leu Arg Ser Pro Gin Glu Met Glu Ser Phe Phe Asn Gin Val 
225 230 235 240 

Gly Leu Gly Gin Ala Leu Lys Asn Thr Lys Asp Val Ala Gin Ser Val 

245 250 255 



Asp Trp Ser Leu Asp Leu Gly Gin Ala Lys Leu Pro Ala Phe Asp Leu 

260 265 270 
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Pro Glu Gly Glu Thr Lys Asp Ser Tyr Leu Gly Lys Leu Ala Gin Lys 
275 280 285 



Gly Leu Gin Glu Arg Val Pro Gly Tyr Gly Gin Asp Tyr Gin Asp Arg 
290 295 300 



Leu Asp Lys Glu Leu Ala Val lie Ser Ser Met Gly Phe Ser Asp Tyr 
305 310 315 320 



Phe Leu He Val Trp Asp Leu Met Gin Phe Ala Arg Gin Glu Lys He 

325 330 335 



Glu Thr Gly Phe Gly Arg Gly Ser Ala Ala Ala Ser Leu Val Ser Tyr 

340 345 350 



Ala Leu Tyr He Thr Gly Val Asp Pro He His Tyr Asp Leu Leu Phe 
355 360 365 



Glu Arg Phe Leu Asn Lys Asp Arg Phe Thr Met Pro Asp He Asp Leu 
370 375 380 



Asp Phe Pro Asp Asn Lys Arg Gin Val He Leu Asp Tyr Val Tyr Arg 
385 390 395 400 



Lys Tyr Gly Pro Asp His Val Ala Gin He Leu Thr Phe Gly Thr Phe 

405 410 415 



Ala Ala Lys Ser Ser He Arg Glu He Met Arg Thr Leu Gly Tyr Lys 

420 425 430 



Asn Glu Asp Met Lys Thr Trp Ser Gin Ala He Pro Asp Thr Val Asn 
435 440 445 



He Ser Leu Ser Lys Ala Tyr Asp Glu Ser Lys Asp Leu Gin Lys Leu 
450 455 460 



Val Gin Gin Ser His Glu Asn Glu Arg He Phe Ala Met Ala Gin Asp 
465 470 475 480 



He Glu Gly Leu Pro Arg Asn Tyr Ser Thr His Ala Ala Gly Val Val 

485 490 495 
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Met Ser Asp Gin Pro Leu lie His Ser Leu Pro Leu Gin Asp Gly Asn 

500 505 510 

Gly Lys Val Pro Asn Thr Gin Phe Thr Met Glu Asp Val Glu Ala Val 
515 520 525 

Gly Leu Leu Lys Met Asp Phe Leu Ser Leu Lys Asn Leu Thr He Leu 
530 535 540 

Ala Asp Cys Leu Asn Phe Ser Gin Tyr Glu Gly Gin Gly Gly Gly He 
545 550 555 560 

Ser Lys Gin Asp He Pro He Asp Asp Pro Lys Thr Leu Asp Leu Phe 

565 570 575 



Ala Arg Gly Asp Thr Asn Gly Val Phe Gin Phe Glu Lys Glu Gly He 

580 585 590 

Lys Lys Val Leu Arg Gin Leu Gin Pro Thr Ser Phe Glu Asp He Val 
595 600 605 

Ala Thr Asn Ala Leu Tyr Arg Pro Gly Pro Met Gly Gin He Glu Asn 
610 615 620 

Tyr He Asn Arg Lys His Gly Gin Glu Lys He He Tyr Pro His Glu 
625 630 635 640 

Asp Leu Lys Asp He Leu Glu Val Thr Tyr Gly He He Val Tyr Gin 

645 650 655 



Glu Gin Val Met Gin Val Ala Thr Gin Leu Ala Gly Tyr Ser Leu Ser 

660 665 670 

Glu Ala Asp Gin Leu Arg Arg Thr Met Ser Lys Lys He Gin Ser Glu 
675 680 685 



Met Asp Gin Gly Arg Glu Lys Phe He Arg Gly Ala Leu Asp Lys Gly 
690 695 700 

Tyr Ser Glu Ser Val Ala Arg Glu Val Tyr Asn Tyr He Ala Lys Phe 
705 710 715 720 
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Ala Asn Tyr Gly Phe Asn Arg Ala His Ala Val Ala Tyr Ser Met Leu 

725 730 735 



Ala Tyr His Met Ala Tyr Phe Lys Val His Gin Pro Lys Ser Phe Phe 

740 745 750 



Ala Ala Val Met Lys Ala Asp Trp Gly Asn Lys Ala Lys lie Tyr Lys 
755 760 765 



Tyr Ala His Glu Val Arg Ala Arg Lys lie Lys Leu Leu Lys Pro Asp 
770 775 780 



lie Asn Gin Ser Leu Gly Ser Phe Thr Val Arg Gin Asn Gly lie Gin 
785 790 795 800 



Val Gly Leu Lys Met Val Lys Gly Val Ala Ser Pro Phe Val Asn His 

805 810 815 



He Leu Glu He Arg Lys Glu Lys Gly Ala Phe Thr Ser Leu Arg Asp 

820 825 830 



Phe Cys Glu Lys He Asp Ser Gin Phe Leu Ser Gin Asp Pro He Glu 
835 840 845 



Ala Leu He Leu Val Gly Ala Phe Asp Gin Met Gly Pro Asn Arg Arg 
850 855 860 



Thr Met Leu Ala Gly Leu Glu Ala Thr He Glu Phe Val Ala Lys Ser 
865 870 875 880 



Ser Gly Asn He Thr Leu Phe Asp Thr Leu Lys Pro Arg Gin Glu Asp 

885 890 895 



Leu Glu Glu Phe Ser Pro Lys Asp Leu He Gin Tyr Glu Glu Glu Leu 

900 905 910 



Thr Gly Phe Tyr Phe Ser Ser His Pro Leu Ser Arg Tyr Asp Ser Leu 
915 920 925 



Arg Gin Asp Leu Lys Thr Ser Phe He Ala Asp Leu Glu Glu Gly Gin 
930 935 940 



Ser Cys Gin Val Leu Gly Gin Leu Val Gin Val Arg Lys Thr Gin Thr 
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945 950 955 960 



Arg Asn Gin Gin Pro Met Ala Phe Val Ser Leu Ala Asp Gin Thr Gly 

965 970 975 



Gin lie Ser Leu Val Val Phe Pro Asn Val Tyr Arg Glu Cys Leu Pro 

980 985 990 



Tyr Leu Lys Glu Gly Val Val Leu Val Val Ser Gly Lys Val Glu Val 
995 1000 1005 



Arg Lys Gly Glu lie Gin Leu Lys Val Gin Thr Met Lys Glu Ala Ser 
1010 1015 1020 



Gin Val Gin Lys Glu Thr Lys Gin Leu Tyr Leu Lys Phe Ala Asp Leu 
1025 1030 1035 1040 



Asn Gin Asp Lys Glu Ser Phe Arg Gin Val Gin Lys lie Leu Ala Arg 

1045 1050 1055 



His Pro Gly Gin Lys Arg Val lie Val Tyr Asp Gin Ala Ser Gin Gin 

1060 1065 1070 



Ala Leu Gin Leu Lys Ala Lys Phe Asn Phe Asp Gly Arg Thr Asp Thr 
1075 1080 1085 



Leu Asn Gin Leu Gin Asp Leu Leu Gly Gin Asp Ser Cys lie Leu Lys 
1090 1095 1100 



<210> 77 
<211> .1212 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (1212) 

<223> 

<400> 77 

acaaag atg ctg aaa aat aaa aag ata gcc tta tat gtt act ggt ggt 

Met Leu Lys Asn Lys Lys lie Ala Leu Tyr Val Thr Gly Gly 
15 10 
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ata gca gta tac aaa tea ctt tac tta ctt agg gaa ate ate aaa caa 96 
He Ala Val Tyr Lys Ser Leu Tyr Leu Leu Arg Glu He He Lys Gin 
15 20 25 30 



ggc ggg gag gtc egg gtt gee atg act caa gca get tgt caa ttt gtt 
Gly Gly Glu Val Arg Val Ala Met Thr Gin Ala Ala Cys Gin Phe Val 

35 40 45 



ggc aag ctg gec aat ggg att ggg gac gat ttt gtt tea aca gec ttg 
Gly Lys Leu Ala Asn Gly He Gly Asp Asp Phe Val Ser Thr Ala Leu 
95 100 105 110 



144 



aac ccc tta tct ttt cag gtt tta age caa aaa aag gtt cag att gac 192 

Asn Pro Leu Ser Phe Gin Val Leu Ser Gin Lys Lys Val Gin He Asp 

50 55 60 

act ttt gaa gaa ggt cag ccc gaa teg gtc agt cac att gat ttg acg 240 

Thr Phe Glu Glu Gly Gin Pro Glu Ser Val Ser His He Asp Leu Thr 

65 70 75 

gat tgg gec gac tac "tec ate gtg get ccg gca act gec aat ate ate 288 

Asp Trp Ala Asp Tyr Ser He Val Ala Pro Ala Thr Ala Asn He He 
80 85 90 



336 



ttg gca acg gac cac ccc att ttt tta gtc cca gee atg aac acc aag 384 
Leu Ala Thr Asp His Pro He Phe Leu Val Pro Ala Met Asn Thr Lys 

115 120 125 

atg tat gaa aat ccc get ctt aag aaa aac aag gec ttc ctt att gaa 432 
Met Tyr Glu Asn Pro Ala Leu Lys Lys Asn Lys Ala • Phe Leu He Glu 

130 135 140 

cag ggc cat tac tgg atg gag ccg gat att gga ttt tta gca gag ggc 480 
Gin Gly His Tyr Trp Met Glu Pro Asp He Gly Phe Leu Ala Glu Gly 
145 150 155 

tac gaa ggc ttg ggt cgt ttt cca gac eta gac egg att atg gcg gaa 528 
Tyr Glu Gly Leu Gly Arg Phe Pro Asp Leu Asp Arg He Met Ala Glu 
160 165 170 

ttt aac cat ttt att att get agg aat cca ggt ate eta tea gga aaa 57 6 

Phe Asn His Phe He He Ala Arg Asn Pro Gly He Leu Ser Gly Lys 
175 180 185 190 

aaa gtc etc gtc aca gca ggt ggg acg gtg gag egg att gat ccc gtc 624 
Lys Val Leu Val Thr Ala Gly Gly Thr Val Glu Arg He Asp Pro Val 

195 200 205 

egg tat att tec aat gat tct tct ggt aag atg ggc cac caa ctt get 672 
Arg Tyr He Ser Asn Asp Ser Ser Gly Lys Met Gly His Gin Leu Ala 

210 215 220 

caa gcg gec tat gaa get ggg gee cag gtt age ttg gta aca gee agt 720 
Gin Ala Ala Tyr Glu Ala Gly Ala Gin Val Ser Leu Val Thr Ala Ser 
225 230 235 



gac ttg ccg acc agt ccc ttt att gac cgc ttt cag gtg gag tec acc 



768 
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Asp Leu Pro Thr Ser Pro Phe lie Asp Arg Phe Gin Val Glu Ser Thr 
240 245 250 

tta gac ttg tac caa aca gtt agt gac etc tat gac cac cat gac att 
L eu Asp Leu Tyr Gin Thr Val Ser Asp Leu Tyr Asp His His Asp lie 
255 260 265 270 



816 



etc atg atg gec gca gcg gtg tct gac tac egg cca gtc aac egg tea 8 64 

Leu Met Met Ala Ala Ala Val Ser Asp Tyr Arg Pro Val Asn Arg Ser 

275 280 285 

gac aaa aag atg aaa aag caa gat aat tta acc att gaa ctg gaa aaa 912 
Asp Lys Lys Met Lys Lys Gin Asp Asn Leu Thr lie Glu Leu Glu Lys 

290 295 300 

aat cct gat att ttg gec gaa atg ggc egg egg aaa gac caa caa ate 960 
Asn Pro Asp He Leu Ala Glu Met Gly Arg Arg Lys Asp Gin Gin He 
305 310 315 

aat gtc ggc ttt gca gca gaa acc cat aac ctt gaa gaa tat gec caa 1008 
Asn Val Gly Phe Ala Ala Glu Thr His Asn Leu Glu Glu Tyr Ala Gin 
320 325 330 

aaa aaa tta gee tec aaa caa get gac ttg ate gta gec aat gaa gtg 1056 
Lys Lys Leu Ala Ser Lys Gin Ala Asp Leu He Val Ala Asn Glu Val 
335 340 345 350 

ggc egg gga gac egg ggc ttt aat gcg gat gaa aat gcg gec ctt gtt 1104 
Gly Arg Gly Asp Arg Gly Phe Asn Ala Asp Glu Asn Ala Ala Leu Val 

355 360 365 

ttt tec agt gac caa gat ccg ctt gag ctt ccc ctt cag tct aaa aaa 1152 
Phe Ser Ser Asp Gin Asp Pro Leu Glu Leu Pro Leu Gin Ser Lys Lys 

370 375 380 

gat atg gca aaa aag att att gaa gtg gtg gec agt aaa ttg cct get 1200 
Asp Met Ala Lys Lys He He Glu Val Val Ala Ser Lys Leu Pro Ala 
385 390 395 

tct ccc aaa taa 1212 
Ser Pro Lys 
400 



<210> 78 
<211> 401 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 78 

Met Leu Lys Asn Lys Lys He Ala Leu Tyr Val Thr Gly Gly He Ala 
15 10 15 



Val Tyr Lys Ser Leu Tyr Leu Leu Arg Glu He He Lys Gin Gly Gly 

20 25 30 
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Glu Val Arg Val Ala Met Thr Gin Ala Ala Cys Gin Phe Val Asn Pro 
35 40 45 



Leu Ser Phe Gin Val Leu Ser Gin Lys Lys Val Gin lie Asp Thr Phe 
50 55 60 



Glu Glu Gly Gin Pro Glu Ser Val Ser His He Asp Leu Thr Asp Trp 
65 70 75 80 



Ala Asp Tyr Ser He Val Ala Pro Ala Thr Ala Asn He He Gly Lys 

85 90 95 



Leu Ala Asn Gly He Gly Asp Asp Phe Val Ser Thr Ala Leu Leu Ala 

100 105 110 



Thr Asp His Pro He Phe Leu Val Pro Ala Met Asn Thr Lys Met Tyr 
115 120 125 



Glu Asn Pro Ala Leu Lys Lys Asn Lys Ala Phe Leu He Glu Gin Gly 
130 135 140 



His Tyr Trp Met Glu Pro Asp He Gly Phe Leu Ala Glu Gly Tyr Glu 
145 150 155 160 



Gly Leu Gly Arg Phe Pro Asp Leu Asp Arg He Met Ala Glu Phe Asn 

165 170 175 



His Phe He He Ala Arg Asn Pro Gly He Leu Ser Gly Lys Lys Val 

180 185 190 



Leu Val Thr Ala Gly Gly Thr Val Glu Arg He Asp Pro Val Arg Tyr 
195 200 205 



lie Ser Asn Asp Ser Ser Gly Lys Met Gly His Gin Leu Ala Gin Ala 
210 215 220 



Ala Tyr Glu Ala Gly Ala Gin Val Ser Leu Val Thr Ala Ser Asp Leu 
225 230 235 240 



Pro Thr Ser Pro Phe He Asp Arg Phe Gin Val Glu Ser Thr Leu Asp 

245 250 255 
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Leu Tyr Gin Thr Val Ser Asp Leu Tyr Asp His His Asp lie Leu Met 

260 265 270 



Met Ala Ala Ala Val Ser Asp Tyr Arg Pro Val Asn Arg Ser Asp Lys 
275 280 285 

Lys Met Lys Lys Gin Asp Asn Leu Thr He Glu Leu Glu Lys Asn Pro 
290 295 300 

Asd He Leu Ala Glu Met Gly Arg Arg Lys Asp Gin Gin He Asn Val 
305 310 315 320 

Gly Phe Ala Ala Glu Thr His Asn Leu Glu Glu Tyr Ala Gin Lys Lys 

325 330 335 

Leu Ala Ser Lys Gin Ala Asp Leu He Val Ala Asn Glu Val Gly Arg 

340 345 350 

Gly Asp Arg Gly Phe Asn Ala Asp Glu Asn Ala Ala Leu Val Phe Ser 
355 360 365 

Ser Asp Gin Asp Pro Leu Glu Leu , Pro Leu Gin Ser Lys Lys Asp Met 
370 375 380 

Ala Lys Lys He He Glu Val" Val Ala Ser Lys Leu Pro Ala Ser Pro 
385 390 395 400 



Lys 



<210> 79 
<211> 1053 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (22) . . (1053) 

<223> 

<400> 79 

aagaagaagg gaggaagact g atg aaa att gaa gac caa etc aaa aaa att 

Met Lys He Glu Asp Gin Leu Lys Lys He 
1 5 10 

aaa gac caa gac ttg tct ccc etc tac ctg gtc cag gga gat gac cag 
Lys Asp Gin Asp Leu Ser Pro Leu Tyr Leu Val Gin Gly Asp Asp Gin 

15 20 25 



51 



99 
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tac ttg tta gac cag gtt aaa aaa agt ttg age cag gec ctt ttg gac 147 
Tyr Leu Leu Asp Gin Val Lys Lys Ser Leu Ser Gin Ala Leu Leu Asp 

30 35 40 

cag gat gaa get tct atg aat ttt ggt caa ttt aat atg atg get gat 195 
Gin Asp Glu Ala Ser Met Asn Phe Gly Gin Phe Asn Met Met Ala Asp 
45 50 55 



age eta gac atg gec ttg tct gat gcg gaa tec tat ccc ttt ttt ggg 
Ser Leu Asp Met Ala Leu Ser Asp Ala Glu Ser Tyr Pro Phe Phe Gly 
60 65 70 



aag egg aaa aca gat ctg gac cat gac ttg gat cgc ttg ctg get tac 
Lys Arg Lys Thr Asp Leu Asp His Asp Leu Asp Arg Leu Leu Ala Tyr 

95 100 105 

etc caa aac cca gec gac ttt act gtt etc gtc ttc ttt gee ccc tat 
Leu Gin Asn Pro Ala Asp Phe Thr Val Leu Val Phe Phe Ala Pro Tyr 

110 115 120 

gag aaa ctg gac aag egg aag aag gtc ace aaa gee eta ttg cag gaa 
Glu Lys Leu Asp Lys Arg Lys Lys Val Thr Lys Ala Leu Leu Gin Glu 
125 130 135 

get gag att ata gat gee agt tec cca gac caa aga gat eta aaa gat 
Ala Glu He He Asp Ala Ser Ser Pro Asp Gin Arg Asp Leu Lys Asp 
140 145 150 



get tta aag gee ctg gtt gaa aaa ace aat gee aac tta agt egg gtc 
Ala Leu Lys Ala Leu Val Glu Lys Thr Asn Ala Asn Leu Ser Arg Val 

175 180 185 

atg caa gag ttg gac aag tta ttc ttg tac cat tta gat gac aaa ate 
Met Gin Glu Leu Asp Lys Leu Phe Leu Tyr His Leu Asp Asp Lys He 

190 195 200 



243 



gac aag cgc ctg gtt tac ate caa gac ccc ttt ttc eta aca ggg gag 291 
Asp Lys Arg Leu Val Tyr He Gin Asp Pro Phe Phe Leu Thr Gly Glu 
75 80 85 90 



339 



387 



435 



483 



atg gtc cag aaa aaa gta aag get cga ggc tac cag ttt gac aaa gga 531 
Met Val Gin Lys Lys Val Lys Ala Arg Gly Tyr Gin Phe Asp Lys Gly 
155 160 165 170 



579 



627 



ate acc gtc cag tea gtt gac cag gtc gta tea cca age ctg gaa agt 675 
He Thr Val Gin Ser Val Asp Gin Val Val Ser Pro Ser Leu Glu Ser 
205 210 215 

aat gtc ttt agt att aac gac tat att tta age ggg caa age cag get 723 
Asn Val Phe Ser He Asn Asp Tyr He Leu Ser Gly Gin Ser Gin Ala 
220 225 230 

get ata egg gec ttt aat gac tta att caa caa aag gaa gag cca att 771 
Ala He Arg Ala Phe Asn Asp Leu He Gin Gin Lys Glu Glu Pro He 
235 240 245 250 
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aaa ate ate gec att atg atg aac caa ttc cgt tta tta ttg cag gtt 
Lys He He Ala He Met Met Asn Gin Phe Arg Leu Leu Leu Gin Val 

255 260 265 

aaa ata ttg egg act aag ggc tac caa caa gga gag ate get aaa ate 
Lys He Leu Arg Thr Lys Gly Tyr Gin Gin Gly Glu He Ala Lys He 

270 275 280 

tta aaa gtt cac ccc tac egg gtt aag eta gec ata gag aaa cag gag 
Leu Lys Val His Pro Tyr Arg Val Lys Leu Ala He Glu Lys Gin Glu 
285 290 295 

att ttt tec aag caa agt eta teg acc gee tac cgc tac tta att gag 
He Phe Ser Lys Gin Ser Leu Ser Thr Ala Tyr Arg Tyr Leu He Glu 
300 305 310 

tea gat cat ttg att aaa acg ggc aag gtg acc teg caa ttg caa ttt 
Ser Asp His Leu He Lys Thr Gly Lys Val Thr Ser Gin Leu Gin Phe 
315 320 325 330 

gaa ctt ttt gec eta caa ttt aaa gat tct gtc atg aat taa 
Glu Leu Phe Ala Leu Gin Phe Lys Asp Ser Val Met Asn 

335 340 



<210> 80 
<211> 343 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 80 

Met Lys He Glu Asp Gin Leu Lys Lys He Lys Asp Gin Asp Leu Ser 
15 10 15 

Pro Leu Tyr Leu Val Gin Gly Asp Asp Gin Tyr Leu Leu Asp Gin Val 

20 25 30 



Lys Lys Ser Leu Ser Gin Ala Leu Leu Asp Gin Asp Glu Ala Ser Met 
35 40 45 

Asn Phe Gly Gin Phe Asn Met Met Ala Asp Ser Leu Asp Met Ala Leu 
50 55 60 

Ser Asp Ala Glu Ser Tyr Pro Phe Phe Gly Asp Lys Arg Leu Val Tyr 
65 70 75 SO 



He Gin Asp Pro Phe Phe Leu Thr Gly Glu Lys Arg Lys Thr Asp Leu 

85 90 35 



819 



867 



915 



963 



1011 



1053 



Asp His Asp Leu Asp Arg Leu Leu Ala Tyr Leu Gin Asn Pro Ala Asp 

100 105 110 
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Phe Thr Val Leu Val Phe Phe Ala Pro Tyr Glu Lys Leu Asp Lys Arg 
115 120 125 

Lys Lys Val Thr Lys Ala Leu Leu Gin Glu Ala Glu lie He Asp Ala 
130 135 I 40 

Ser Ser Pro Asp Gin Arg Asp Leu Lys Asp Met Val Gin Lys Lys Val 
145 150 155 160 

Lys Ala Arg Gly Tyr Gin Phe Asp Lys Gly Ala Leu Lys Ala Leu Val 

165 170 175 

Glu Lys Thr Asn Ala Asn Leu Ser Arg Val Met Gin Glu Leu Asp Lys 

180 185 150 

Leu Phe Leu Tyr His Leu Asp Asp Lys He He Thr Val Gin Ser Val 
19 5 200 205 

Asp Gin Val Val Ser Pro Ser Leu Glu Ser Asn Val Phe Ser He Asn 
210 215 220 

Asp Tyx He Leu Ser Gly Gin Ser Gin Ala Ala He Arg Ala Phe Asn 
225 230 235 240 

Asp Leu He Gin Gin Lys Glu Glu Pro He Lys He He Ala He Met 

245 250 255 

Met Asn Gin Phe Arg Leu Leu Leu Gin Val Lys He Leu Arg Thr Lys 

260 265 270 

Gly Tyr Gin Gin Gly Glu He Ala Lys He Leu Lys Val His Pro Tyr 
2 75 280 285 

Arg Val Lys Leu Ala He Glu Lys Gin Glu He Phe Ser Lys Gin Ser 
290 295 300 

Leu Ser Thr Ala Tyr Arg Tyr Leu He Glu Ser Asp His Leu He Lys 
305 310 315 320 

Thr Gly Lys Val Thr Ser Gin Leu Gin Phe Glu Leu Phe Ala Leu Gin 

325 330 335 
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Phe Lys Asp Ser Val Met Asn 

340 



<210> 81 
<211> 477 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (1) . - (477) 

<223> 



atg aat cgc gca ate tat gca ggc agt ttt gat ccg att ace ctg ggc 

Me? Asn Arg Ala He Tyr Ala Gly Ser Phe Asp Pro He Thr Leu Gly 

cac ctg gat ate att aaa agg gec age cac tta ttc gat gaa gtc ate 

His Leu Asp He He Lys Arg Ala Ser His Leu Phe Asp Glu Val He 



25 30 

gtt gca gtt get aat aat aca teg aaa aat agt atg ttg aac ttt gac 
Val Ala Val Ala Asn Asn Thr Ser Lys Asn Ser Met Leu Asn Phe Asp 
35 40 45 

caa aaa ttg aac ctg gtt gaa caa tea att get age cag ggt eta get 
Gin Lys Leu Asn Leu Val Glu Gin Ser Xle Ala Ser Gin Gly Leu Ala 
50 55 60 

aat gtt caa gec aag aca tta gag tea ggc ttg att gtt gac ttt get 
Asn Val Gin Ala Lys Thr Leu Glu Ser Gly Leu He Val Asp Phe Ala 
65 70 75 80 

aag gac caa gga get agt agt ctg gtt agg ggg ttg egg teg gtt aaa 
Lys Asp Gin Gly Ala Ser Ser Leu Val Arg Gly Leu Arg Ser Val Lys 

85 90 95 

gac ttt gaa tat gag att gee att gag gac tta aat aag gtc caa gac 
Asp Phe Glu Tyr Glu He Ala He Glu Asp Leu Asn Lys Val Gin Asp 

100 105 HO 

cea get att gaa aea gtt tac eta gtc teg tct tec aaa tac egg tee 
Pro Ala He Glu Thr Val Tyr Leu Val Ser Ser Ser Lys Tyr Arg Ser 
115 120 125 

att tct tec tct att gtt egg gaa att att aag ttt aat ggc egg ctt 
He Ser Ser Ser He Val Arg Glu He He Lys Phe Asn Gly Arg Leu 
130 135 140 

gat gac eta gta cct gac ccc gtc gtc gaa tat ttt aaa aaa taa 
Asp Asp Leu Val Pro Asp Pro Val Val Glu Tyr Phe Lys Lys 
!45 150 155 



48 



96 



144 



192 



240 



288 



336 



384 



432 



477 
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<210> 82 
<211> 158 
<212> PRT 

<213> Alloiococcus otitidis 

M e t°L! 2 Arg Ala He Tyr Ala Gly Ser Phe Asp Pro He Thr Leu Gly 

His Leu Asp He He Lys Arg Ala Ser His Leu Phe Asp Glu Val He 

20 25 30 

Val Ala Val Ala Asn Asn Thx Ser Lys Asn Ser Met Leu Asn Phe Asp 
35 40 45 

Gin Lys Leu Asn Leu Val Glu Gin Ser He Ala Ser Gin' Gly Leu Ala 
50 55 60 

Asn Val Gin Ala Lys Thr Leu Glu Ser Gly Leu He Val Asp Phe Ala 
65 70 75 80 

Lys Asp Gin Gly Ala Ser Ser Leu Val Arg Gly Leu Arg Ser Val Lys 

85 9° 95 

Asp Phe Glu Tyr Glu He Ala He Glu Asp Leu Asn Lys Val Gin Asp 

100 105 HO 

Pro Ala He Glu Thr Val Tyr Leu Val Ser Ser Ser Lys Tyr Arg Ser 
115 120 125 

He Ser Ser Ser He Val Arg Glu He He Lys Phe Asn Gly Arg Leu 
130 135 14° 

Asp Asp Leu Val Pro Asp Pro Val Val Glu Tyr Phe Lys Lys 
145 150 155 



<210> 83 
<211> 1260 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (28) . . (1260) 

<223> 



<400> 83 
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ataaggattt caggaggaat actcata atg gat ttc aac tta gat aat aca gtt 

Met Asp Phe Asn Leu Asp Asn Thr Val 
1 5 

tea ggt ggc gca aag att aag gtt att ggt gtt ggc ggt get ggt ggc 
Ser Gly Gly Ala Lys lie Lys Val lie Gly Val Gly Gly Ala Gly Gly 
10 15 20 25 

aat gec gtt aac egg atg att gaa gat gga gtc gaa ggc gtt gaa ttt 
Asn Ala Val Asn Arg Met He Glu Asp Gly Val Glu Gly Val Glu Phe 

30 35 40 

att gta gec aat aca gat gtc caa gec ctt gat gee aac cga get gag 
He Val Ala Asn Thr Asp Val Gin Ala Leu Asp Ala Asn Arg Ala Glu 

45 50 55 

act aaa att caa etc gga gag aag tta acc agg gga etc ggt gec gga 
Thr Lys He Gin Leu Gly Glu Lys Leu Thr Arg Gly Leu Gly Ala Gly 
60 65 70 

get aat cca gaa gtt ggc cgt aag teg get gaa gag agt gaa gaa acc 
Ala Asn Pro Glu Val Gly Arg Lys Ser Ala Glu Glu Ser Glu Glu Thr 
75 80 85 

att gec gaa get ctt gaa gga get gac atg gtc ttc gtt act get ggt 
He Ala Glu Ala Leu Glu Gly Ala Asp Met Val Phe Val Thr Ala Gly 
90 95 100 105 

atg ggt ggc ggt act ggt act ggc ggg gcg ggc att att gec cgc att 
Met Gly Gly Gly Thr Gly Thr Gly Gly Ala Gly He He Ala Arg He 

110 115 120 

gec aaa gaa caa ggg get ttg act gta ggg gtt att acc egg ccg ttc 
Ala Lys Glu Gin Gly Ala Leu Thr Val Gly Val He Thr Arg Pro Phe 

125 130 . 135 

act ttt gaa gga cca aaa cgt ggg cgc ttt gca gec gaa ggg att gee 
Thr Phe Glu Gly Pro Lys Arg Gly Arg Phe Ala Ala Glu Gly He Ala 
140 145 150 

caa atg egg gaa cat gtt gac acc ctt gtc acc ate tec aac aac cgc 
Gin Met Arg Glu His Val Asp Thr Leu Val Thr He Ser Asn Asn Arg 
155 160 165 

ttg eta gaa att gtg gac aag aaa aca ccg atg atg gaa gee ttc aga 
Leu Leu Glu He Val Asp Lys Lys Thr Pro Met Met Glu Ala Phe Arg 
170 175 180 185 

gaa gca gat aat gtc etc cgc caa ggg gtt caa ggt ata tct gac ttg 
Glu Ala Asp Asn Val Leu Arg Gin Gly Val Gin Gly He Ser Asp Leu 

190 195 200 

att acc aat cca ggc tac gtc aac tta gac ttt gee gat gtc aaa acg 
He Thr Asn Pro Gly Tyr Val Asn Leu Asp Phe Ala Asp Val Lys Thr 

205 210 215 



54 



102 



150 



198 



246 



294 



342 



390 



438 



486 



534 



582 



630 



678 



gtg atg gec aac caa ggt tct gec ttg atg ggg att ggg tct get tea 726 
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Val Met Ala Asn Gin Gly Ser Ala Leu Met Gly He Gly Ser Ala Ser 
220 225 230 

ggt gag aat aga acg get gaa get act aag aaa get att tea tct cca 
Gly Glu Asn Arg Thr Ala Glu Ala Thr Lys Lys Ala He Ser Ser Pro 
235 240 245 

ctt ttg gaa gtc tec etc aat ggg get gaa aat gtc eta tta aac ata 
Leu Leu Glu Val Ser Leu Asn Gly Ala Glu Asn Val Leu Leu Asn lie 
250 255 260 265 

ace gga aac caa gac tta ace etc ttt gaa get caa gat get tct gat 
Thr Gly Asn Gin Asp Leu Thr Leu Phe Glu Ala Gin Asp Ala Ser Asp 

270 275 280 

ate gtc ggg get get get tet ggt gat gtt aat att ate ttc ggt act 
He Val Gly Ala Ala Ala Ser Gly Asp Val Asn He He Phe Gly Thr 

285 290 295 

tec ate aat gaa gac ctg gaa gat gag gtc ate gtt acc gtt att gca 
Ser He Asn Glu Asp Leu Glu Asp Glu Val He Val Thr Val He Ala 
300 305 310 

act ggt ate act ggt aaa gac atg ggc gag aaa tct tct aaa tec tea 
Thr Gly He Thr Gly Lys Asp Met Gly Glu Lys Ser Ser Lys Ser Ser 
315 320 325 

aac cgt age caa ggt cct agt caa aaa agt caa get cga tea get agt 
Asn Arg Ser Gin Gly Pro Ser Gin Lys Ser Gin Ala Arg Ser Ala Ser 
330 335 340 345 

gag tct age ttc tct age tgg caa aac caa tec aat gaa aga cca ggg 
Glu Ser Ser Phe Ser Ser Trp Gin Asn Gin Ser Asn Glu Arg Pro Gly 

350 355 360 

gaa gac caa gac cga cca age tct caa aga egg gaa gtc gat egg tec 
Glu Asp Gin Asp Arg Pro Ser Ser Gin Arg Arg Glu Val Asp Arg Ser 

365 370 375 

gaa aac ctg ttc aat gac gat agt aag gac cag cca gca gac tct ggt 
Glu Asn Leu Phe Asn Asp Asp Ser Lys Asp Gin Pro Ala Asp Ser Gly 
380 385 390 

gat gat gac gaa ttg gat acc cct cct ttc ttt aga cgt cgc cgc aag 
Asp Asp Asp Glu Leu Asp Thr Pro Pro Phe Phe Arg Arg Arg Arg Lys 
395 400 405 

aat tag 

Asn 

410 



774 



822 



870 



918 



966 



1014 



1062 



1110 



1158 



1206 



1254 



1260 



<210> '84 
<211> 410 
<212> PRT 

<213> Alloiococcus 



otitidis 
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<400> 84 _ „ 

Met Asp Phe Asn Leu Asp Asn Thr Val Ser Gly Gly Ala Lys lie Lys 

1 5 10 15 

Val He Gly Val Gly Gly Ala Gly Gly Asn Ala Val Asn Arg Met He 

20 25 30 

Glu Asp Gly Val Glu Gly Val Glu Phe He Val Ala Asn Thr Asp Val 
35 40 45 

Gin Ala Leu Asp Ala Asn Arg Ala Glu Thr Lys He Gin Leu Gly Glu 
50 • 55 60 

Lys Leu Thr Arg Gly Leu Gly Ala Gly Ala Asn Pro Glu Val Gly Arg 
65 70 75 80 

Lys Ser Ala Glu Glu Ser Glu Glu Thr He Ala Glu Ala Leu Glu Gly 

85 90 95 

Ala Asp Met Val Phe Val Thr Ala Gly Met Gly Gly Gly Thr Gly Thr 

100 105 HO 

Gly Gly Ala Gly He He Ala Arg He Ala Lys Glu Gin Gly Ala Leu 
115 120 125 

Thr Val Gly Val He Thr Arg Pro Phe Thr Phe Glu Gly Pro Lys Arg 
130 135 140 

Gly Arg Phe Ala Ala Glu Gly He Ala Gin Met Arg Glu His Val Asp 
145 150 155 160 

Thr Leu Val Thr He Ser Asn Asn Arg Leu Leu Glu He Val Asp Lys 

165 170 175 

Lys Thr Pro Met Met Glu Ala Phe Arg Glu Ala Asp Asn Val Leu Arg 

180 185 190 

Gin Gly Val Gin Gly lie Ser Asp Leu He Thr Asn Pro Gly Tyr Val 
195 200 205 

Asn Leu Asp Phe Ala Asp Val Lys Thr Val Met Ala Asn Gin Gly Ser 
210 215 220 
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Ala Leu Met Gly He Gly Ser Ala Ser Gly Glu Asn Arg Thr Ala Glu 
225 230 235 

Ala Thr Lys Lys Ala He Ser Ser Pro Leu Leu Glu Val Ser Leu Asn 



245 



250 



Gly Ala Glu Asn Val Leu Leu Asn He Thr Gly Asn Gin Asp Leu Thr 
Y 260 265 270 

i 

Phe Glu Ala Gin Asp Ala Ser Asp He Val Gly Ala Ala Ala Ser 



Leu 

275 



280 285 



Gly Asp Val Asn He lie Phe Gly Thr Ser He Asn Glu Asp Leu Glu 

295 300 



290 



Asp Glu Val He Val Thr Val He Ala Thr Gly He Thr Gly Lys Asp 
305 310 31 

Met Gly Glu Lys Ser Ser Lys Ser Ser Asn Arg Ser Gin Gly Pro Ser 

325 330 

Gin Lys Ser Gin Ala Arg Ser Ala Ser Glu Ser Ser Phe Ser Ser Trp 

345 350 



340 



Gin Asn Gin Ser Asn Glu Arg Pro Gly Glu Asp Gin Asp Arg Pro Ser 
355 3 60 365 

Ser Gin Arg Arg Glu Val Asp Arg Ser Glu Asn Leu Phe Asn Asp Asp 
370 3 *75 380 

Ser Lys Asp Gin Pro Ala Asp Ser Gly Asp Asp Asp Glu Leu Asp Thr 
385 390 395 

Pro Pro Phe Phe Arg Arg Arg Arg Lys Asn 

405 410 



<210> 85 
<211> 1377 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (13) . . (1377) 

<223> 
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acagataata ga atg ttt tta gat atg gag gtt tea atg aat atg aaa ut 51 
acagataata g ^ ^ ^ ^ ^ ^ ^ ^ Agn Met Lys ^ 



1 5 



ggg gtt tat aca age ctt gat att gga acc act tea ata aaa gta gtt 
Glv Val Tyr Thr Ser Leu Asp He Gly Thr Thr Ser lie Lys Val Val 
15 20 25 

„. r aCTt aaa gtt gat aat aat cag etc aaa gtt att gga gta gga aaa 
Val III Glu vll Lp Asn Asn Gin Leu Lys Val He Gly Val Gly Lys 
30 35 40 

get caa tea aaa ggt tta aaa agg ggc atg gtt gtc gat ata gat get 
Ala Gin Ser Lys Gly Leu Lys Arg Gly Met Val Val Asp He Asp Ala 

50 55 

af -r ate caa acc att cat act gca gtg aag cag get get gat aag act 
X£ Val Gin E S. His Thr Ala Val Lys Gin Ala Ala Asp Lys Thr 

65 70 75 

qqt gtt atg ate aac cag etc att gtt gga gtt cct get aat ggt gtt 
III Val Met He Asn Gin Leu He Val Gly Val Pro Ala Asn Gly Val 

85 90 



80 



agt att gaa ccc tgt cac ggg gtc att act gta gat gac egg tec aag 
Ser He Glu Pro Cys His Gly Val He Thr Val Asp Asp Arg Ser Lys 



95 



100 1° 5 



gaa ata gac age cag gaa gtg aae egg gta gtc aac cag tec att get 
Glu He Asp Ser Gin Glu Val Asn Arg Val Val Asn Gin Ser He Ala 

3_ 5 



110 



aat ate gtt ccg cca gat aga gac tta tta tec gtc agt tta gaa gaa 
Asn He Val Pro Pro Asp Arg Asp Leu Leu Ser Val Ser Leu Glu Glu 

130 135 

ttt att gta gat ggt ttt gat gaa att cat gat ccg aga ggc atg gtg 
Pne He Val Asp Gly Phe Asp Glu He His Asp Pro Arg Gly Met Val 

150 I 55 



145 



ggc cag egg tta gaa ctt tac ggg aca gca att tea gtg cct aaa aca 
G?y Gin £rg Leu Glu Leu Tyr Gly Thr Ala lie Ser Val Pro Lys Thr 
160 165 170 

att tta cat aac att aga cgt tgt gtt gaa aaa gcg ggc tat caa att 
lie Leu His Asn He Arg Arg Cys Val Glu Lys Ala Gly Tyr Gin He 
175 180 185 

get gec tta att etc cag ccc caa gee atg gee aag gta gee ttg tct 
La Ala Leu He Leu Gin Pro Gin Ala Met Ala Lys Val Ala Leu Ser 



190 



195 200 



gag gat gag egg aat ttt ggt aca gtt atg gtg gat ata ggc gga ggt 
Glu Lp Glu Arg Asn Phe Gly Thr Val Met Val Asp He Gly Gly Gly 



210 215 



99 



147 



195 



243 



291 



339 



387 



435 



483 



531 



579 



627 



675 
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caa acg acc eta tea gee att cac gat gag caa gtg aag tat gee aat 
Gin Thr Thr Leu Ser Ala He His Asp Glu Gin Val Lys Tyr Ala Asn 

225 230 235 

gtg gtc caa gaa gec gga gaa tat att acc aaa gac att tec att gtc 
Val Val Gin Glu Ala Gly Glu Tyr lie Thr Lys Asp He Ser He Val 
240 245 250 

ate aac acc tea cag caa aat gca gaa aag etc aaa aga gaa gtt ggg 
He Asn Thr Ser Gin Gin Asn Ala Glu Lys Leu Lys Arg Glu Val Gly 
255 260 265 

gec att aaa agt cag tct gat tea act gtt caa gta gat gtt gta ggt 
Ala He Lys Ser Gin Ser Asp Ser Thr Val Gin Val Asp Val Val Gly 
270 275 280 285 

caa aat gaa cct gtg aag att aaa gaa tec tat gtc ggt gaa att att 
Gin Asn Glu Pro Val Lys He Lys Glu Ser Tyr Val Gly Glu He He 

295 J0U 



290 



gaa gee egg gtt age caa ate ttt gaa aaa gtg aag get gac ctt gac 
Glu Ala Arg Val Ser Gin He Phe Glu Lys Val Lys Ala Asp Leu Asp 

305 310 315 

cca att aac gec ttc caa ttg cca ggt ggt gec gtt att tec ggc ggt 
Pro He Asn Ala Phe Gin Leu Pro Gly Gly Ala Val He Ser Gly Gly 
320 325 330 

tea get gec ata cca ggt att gac age ttg get gaa gac ate ttc aag 
Ser Ala Ala He Pro Gly He Asp Ser Leu Ala Glu Asp He Phe Lys 
335 340 345 



gtt egg tea gag etc tac att ccc gac tac atg ggt ate cga act ccc 
Val Arg Ser Glu Leu Tyr He Pro Asp Tyr Met Gly He Arg Thr Pro 
350 355 360 365 

gee ttc act gtg gca gtc ggc ttg acc etc tac caa gec cag act tct 
Ala Phe Thr Val Ala Val Gly Leu Thr Leu Tyr Gin Ala Gin Thr Ser 

370 375 380 

gat att gag egg gee ate aac cag tec ate ttg caa aat ate ggt att 
Asp He Glu Arg Ala He Asn Gin Ser He Leu Gin Asn He Gly He 

385 390 395 

aat cca gat age cag cct get aac egg ata gtt gac cag gat gat tea 
Asn Pro Asp Ser Gin Pro Ala Asn Arg He Val Asp Gin Asp Asp Ser 
400 405 410 

gtc caa agt cag gac caa aag acg caa gat gag cca gca gga gac caa 
Val Gin Ser Gin Asp Gin Lys Thr Gin Asp Glu Pro Ala Gly Asp Gin 
415 420 425 

get agt cag teg gat agt cca gaa gaa ggc aat ttt aca gac aga ate 
Ala Ser Gin Ser Asp Ser Pro Glu Glu Gly Asn Phe Thr Asp Arg He 
430 435 440 44 5 



723 



771 



819 



867 



915 



963 



1011 



1059 



1107 



1155 



1203 



1251 



1299 



1347 
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aag cat ttc ttt act aca ttt ttc gat taa 
Lys His Phe Phe Thr Thr Phe Phe Asp 

450 



<210> 86 
<211> 454 
<212> PRT 

<213> Alloiococcus otitidis 

Met°phe 6 Leu Asp Met Glu Val Ser Met Asn Met Lys Asn Gly Val Tyr 
X 5 10 15 

Thr Ser Leu Asp He Gly Thr Thr Ser He Lys Val Val Val Ser Glu 

20 25 30 

Val Asp Asn Asn Gin Leu Lys Val He Gly Val Gly Lys Ala Gin Ser 
35 40 45 

Lys Gly Leu Lys Arg Gly Met Val Val Asp He Asp Ala Thr Val Gin 
50 55 60 

Ala He His Thr Ala Val Lys Gin Ala Ala Asp Lys Thr Gly Val Met 
65 70 75 80 

He Asn Gin Leu He Val Gly Val Pro Ala Asn Gly Val Ser lie Glu 

85 ^0 95 

Pro Cys His Gly Val He Thr Val Asp Asp Arg Ser Lys Glu He Asp 

100 105 HO 

Ser Gin Glu Val Asn Arg Val Val Asn Gin Ser He Ala Asn He Val 
115 120 125 

Pro Pro Asp Arg Asp Leu Leu Ser Val Ser Leu Glu Glu Phe He Val 
130 135 140 

Asp Gly Phe Asp Glu He His Asp Pro Arg Gly Met Val Gly Gin Arg 
145 150 155 160 

Leu Glu Leu Tyr Gly Thr Ala He Ser Val Pro Lys Thr He Leu His 

165 170 175 

Asn He Arg Arg Cys Val Glu Lys Ala Gly Tyr Gin He Ala Ala Leu 

180 185 190 



1377 
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lie Leu Gin Pro Gin Ala Met Ala Lys Val Ala Leu Ser Glu Asp Glu 
195 200 205 

Ara Asn Phe Gly Thr Val Met Val Asp He Gly Gly Gly Gin Thr Thr 
210 215 220 

Leu Ser Ala He His Asp Glu Gin Val Lys Tyr Ala Asn Val Val Gin 
225 230 235 240 

Glu Ala Gly Glu Tyr He Thr Lys Asp He Ser He Val He Asn Thr 

245 250 255 

Ser Gin Gin Asn Ala Glu Lys Leu Lys Arg Glu Val Gly Ala He Lys 

260 265 270 

Ser Gin Ser Asp Ser Thr Val Gin Val Asp Val Val Gly Gin Asn Glu 
275 280 285 

Pro Val Lys He Lys Glu Ser Tyr Val Gly Glu He He Glu Ala Arg 
290 295 300 

Val Ser Gin He Phe Glu Lys Val Lys Ala Asp Leu Asp Pro He Asn 
305 310 315 320 

Ala Phe Gin Leu Pro Gly Gly Ala Val He Ser Gly Gly Ser Ala Ala 

325 330 335 

He Pro Gly He Asp Ser Leu Ala Glu Asp He Phe Lys Val Arg Ser 

340 345 350 

Glu Leu Tyr He Pro Asp Tyr Met Gly He Arg Thr Pro Ala Phe Thr 
355 360 365 

Val Ala Val Gly Leu Thr Leu Tyr Gin Ala Gin Thr Ser Asp He Glu 
370 375 380 

Arg Ala He Asn Gin Ser He Leu Gin Asn He Gly He Asn Pro Asp 
385 390 395 400 

Ser Gin Pro Ala Asn Arg He Val Asp Gin Asp Asp Ser Val Gin Ser 

405 - 410 415 
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Gin Asp Gin Lys Thr Gin Asp Glu Pro Ala Gly Asp Gin Ala Ser Gin 

420 425 430 

Ser Asp Ser Pro Glu Glu Gly Asn Phe Thr Asp Arg He Lys His Phe 
435 440 445 



Phe Thr Thr Phe Phe Asp 
450 



<210> 87 
<211> 1179 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (1179) 

<223> 

<400> 87 

agcaaaggag caagt atg gaa act aaa aaa caa gca tta aaa gtt tta tta 

Met Glu Thr Lys Lys Gin Ala Leu Lys Val Leu Leu 
15 10 

tea ggc ggt gga aca ggt ggc cat ate tac cca gec ttg gec ctt get 
Ser Gly Gly Gly Thr Gly Gly His He Tyr Pro Ala Leu Ala Leu Ala 
15 20 25 

aag cac eta get age tta cac tea gat gtc gag ttt ttg tat gtt ggc 
Lys His Leu Ala Ser Leu His Ser Asp Val Glu Phe Leu Tyr Val Gly 
30 35 40 

act caa agg gga ttg gaa aat aaa ttg gtc ccc caa gca gga ctt gac 
Thr Gin Arg Gly Leu Glu Asn Lys Leu Val Pro Gin Ala Gly Leu Asp 
45 50 55 60 

ttt ate ccg ate aaa gta gaa gga ttt age egg aag ttt aac ttc aaa 
Phe He Pro He Lys Val Glu Gly Phe Ser Arg Lys Phe Asn Phe Lys 

65 70 75 

age att aaa tat aat act aaa agt ctg att tat ttt eta aag gee ctg 
Ser He Lys Tyr Asn Thr Lys Ser Leu He Tyr Phe Leu Lys Ala Leu 

80 85 90 

agt aag tct aag caa ate ate aaa gac ttt cag cca gat gtg gta ata 
Ser Lys Ser Lys Gin He He Lys Asp Phe Gin Pro Asp Val Val lie 
95 100 105 

ggg aca ggt ggt tat gtt tgt gec cct gtc ata tac cag gcg acc aag 
Gly Thr Gly Gly Tyr Val Cys Ala Pro Val He Tyr Gin Ala Thr Lys 
110 115 120 

tta ggc att cca agt etc att cac gaa caa aat agt gtc gec ggg gtg 



51 



99 



147 



195 



243 



291 



339 



387 



435 
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Leu Gly He Pro Ser Leu He His Glu Gin Asn Ser Val Ala Gly Val 
125 130 135 

acc aat aag ttt ttg get egg tac gta gac aag att gec eta agt ttc 
Thr Asn Lys Phe Leu Ala Arg Tyr Val Asp Lys He Ala Leu Ser Phe 

145 150 155 

cag gaa get gaa aaa tec ttt gee aag tat aag gat aag ctg gtt ttg 
Gin Glu Ala Glu Lys Ser Phe Ala Lys Tyr Lys Asp Lys Leu Val Leu 

160 165 170 

act ggt aat cca aga gga cag gaa gtc age caa gtc aag ggt ggc ctt 
Thr Gly Asn Pro Arg Gly Gin Glu Val Ser Gin Val Lys Gly Gly Leu 
175 180 185 

age etc cac aag tat ggc atg gac atg tec caa cct tea gta att att 
Ser Leu His Lys Tyr Gly Met Asp Met Ser Gin Pro Ser Val He He 
190 195 200 

ttt ggt ggg tea agg ggg get tat get att aat aag gee ttt gtt gag 
Phe Gly Gly Ser Arg Gly Ala Tyr Ala He Asn Lys Ala Phe Val Glu 
205 210 215 220 

gca tat agt caa ctg get gag agg gac tac cag gtc ttg ttt gtg ccg 
Ala Tyr Ser Gin Leu Ala Glu Arg Asp Tyr Gin Val Leu Phe Val Pro 

225 230 235 

gga tea get aat ttt age egg ata aaa cag gaa att gat aac cgc tat 
Gly ser Ala Asn Phe Ser Arg He Lys Gin Glu He Asp Asn Arg Tyr 

240 245 250 

ggc cag cat aag ccg tea aac att ttt att gaa tec tat ate gat aac 
Gly Gin His Lys Pro Ser Asn He Phe He Glu Ser Tyr He Asp Asn 
255 260 265 

atg ccc caa gtt ttt aag get att gac ttg gtg gtt tgc cgt agt ggg 
Met Pro Gin Val Phe Lys Ala He Asp Leu Val Val Cys Arg Ser Gly 
270 275 280 

gec act acc eta gee gaa att atg tea tta ggc ttg gec age att tta 
Ala Thr Thr Leu Ala Glu He Met Ser Leu Gly Leu Ala Ser He Leu 
285 290 295 300 

att cca agt ccc aat gta acg get gac cac caa acc aaa aat get atg 
He Pro Ser Pro Asn Val Thr Ala Asp His Gin Thr Lys Asn Ala Met 

305 310 315 

agt ttg gtt aac caa caa get ggc tta atg att aag gaa aat gat eta 
Ser Leu Val Asn Gin Gin Ala Gly Leu Met He Lys Glu Asn Asp Leu 

320 325 330 

aat ggc caa age etc tta aac tgc tta gat gac ctg atg cat gat gac 
Asn Gly Gin Ser Leu Leu Asn Cys Leu Asp Asp Leu Met His Asp Asp 
335 340 345 

gca aaa aga aac aag atg gec caa caa gcg aaa gaa atg ggc caa ccc 
Ala Lys Arg Asn Lys Met Ala Gin Gin Ala Lys Glu Met Gly Gin Pro 



483 



531 



579 



627 



675 



723 



771 



819 



867 



915 



963 



1011 



1059 



1107 
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350 355 360 

caa get tea gac aag ttg ate get etc ate ttg tec atg gtt aag gaa 

Gin Ala Ser Asp Lys Leu lie Ala Leu lie Leu Ser Met Val Lys Glu 

365 370 375 380 



1155 



gat att aac tea gac ate gat taa 
Asp lie Asn Ser Asp lie Asp 

385 



<210> 88 
<211> 387 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 88 

Met Glu Thr Lys Lys Gin Ala Leu Lys Val Leu Leu Ser Gly Gly Gly 
15 10 15 

Thr Gly Gly His lie Tyr Pro Ala Leu Ala Leu Ala Lys His Leu Ala 

20 25 30 

Ser Leu His Ser Asp Val Glu Phe Leu Tyr Val Gly Thr Gin Arg Gly 
35 40 45 



Leu Glu Asn Lys Leu Val Pro Gin Ala Gly Leu Asp Phe lie Pro lie 
50 55 60 



Lys Val Glu Gly Phe Ser Arg Lys Phe Asn Phe Lys Ser He Lys Tyr 
65 70 75 80 



Asn Thr Lys Ser Leu He Tyr Phe Leu Lys Ala Leu Ser Lys Ser Lys 

85 90 95 



Gin He He Lys Asp Phe Gin Pro Asp Val Val He Gly Thr Gly Gly 

100 105 110 



Tyr Val Cys Ala Pro Val He Tyr Gin Ala Thr Lys Leu Gly He Pro 
115 120 125 



Ser Leu He His Glu Gin Asn Ser Val Ala Gly Val Thr Asn Lys Phe 
130 135 140 



1179 



Leu Ala Arg Tyr Val Asp Lys He Ala Leu Ser Phe Gin Glu Ala Glu 
145 150 155 160 
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Lys Ser Phe Ala Lys Tyr Lys Asp Lys Leu val Leu Thr Gly Asn Pro 

165 170 

Arq Gly Gin Glu Val Ser Gin Val Lys Gly Gly Leu Ser Leu His Lys 

180 185 19° 

Tyr Gly Met Aso Met Ser Gin Pro Ser Val lie lie Phe Gly Gly Ser 
195 " 200 205 

Arg Gly Ala Tyr Ala lie Asn Lys Ala Phe Val Glu Ala Tyr Ser Gin 
210 215 220 

Leu Ala Glu Arg Asp Tyr Gin Val Leu Phe Val Pro Gly Ser Ala Asn 
225 230 2 

Phe Ser Arg He Lys Gin Glu He Asp Asn Arg Tyr Gly Gin His Lys 

245 250 255 

Pro Ser Asn He Phe He Glu Ser Tyr He Asp Asn Met Pro Gin Val 

260 265 270 

Phe Lys Ala He Asp Leu Val Val Cys Arg Ser Gly Ala Thr Thr Leu 
275 280 • 285 

Ala Glu He Met Ser Leu Gly Leu Ala Ser He Leu He Pro Ser Pro 
290 295 300 



Asn Val Thr Ala Asp His Gin Thr Lys Asn Ala Met Ser Leu Val Asn 
305 310 315 320 



Gin Gin Ala Gly Leu Met He Lys Glu Asn Asp Leu Asn Gly Gin Ser 

325 330 335 

Leu Leu Asn Cys Leu Asp Asp Leu Met His Asp Asp Ala Lys Arg Asn 

340 345 350 

Lys Met Ala Gin Gin Ala Lys Glu Met Gly Gin Pro Gin Ala Ser Asp 
355 360 365 

Lys Leu He Ala Leu He Leu Ser Met Val Lys Glu Asp He Asn Ser 
370 375 380 



Asp lie Asp 
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385 



<210> 89 
<2ll> 1428 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25).. (1428) 

<223> 

cg^acttag aaaggtttgg caag atg gtt gat teg gta ttt tgc aat aag 51 
y Met Val Asp Ser Val Phe Cys Asn Lys 

1 5 



aag gtt tta gtt tta ggc ttg gca aaa age ggg etc agt gcg gee cat 
LyI Val Leu Val Leu Gly Leu Ala Lys Ser Gly Leu Ser Ala Ala His 
10 15 20 25 

ttg tta aaa aaa eta ggg gec aag gtc ate gtc aat gac aag ttg gec 
Leu Leu Lys Lys Leu Gly Ala Lys Val lie Val Asn Asp Lys Leu Ala 

30 35 

eta gaa aat aat acg gaa gec cag gtc tta att gaa gag ggc ttc caa 
Leu Glu Asn Asn Thr Glu Ala Gin Val Leu He Glu Glu Gly Phe Gin 

45 50 55 

gtt ate ace ggc tac cac cca gag gat tta ctt gat gca age ttt gac 
Val lie Thr Gly Tyr His Pro Glu Asp Leu Leu Asp Ala Ser Phe Asp 
60 65 70 

ttt gtc gtc aag aat ccg ggc att cct tac acc aat cca gtg gta ggc 
Phe Val Val Lys Asn Pro Gly He Pro Tyr Thr Asn Pro Val Val Gly 
75 80 85 

cag get gaa aaa ctg get att ccc att tta act gaa gtg gac gtg gca 
Gin Ala Glu Lys Leu Ala He Pro He Leu Thr Glu Val Asp Val Ala 
90 95 100 105 

gga age ate tta aaa gec aag ccc ate get gtt ace ggg ace aat ggc 
Gly Ser lie Leu Lys Ala Lys Pro He Ala Val Thr Gly Thr Asn Gly 

110 H5 I 20 

aag aca act acc gta tct tta att tat gat att tta gec caa gat caa 
Lys Thr Thr Thr Val Ser Leu He Tyr Asp He Leu Ala Gin Asp Gin 

125 130 135 

gcg gaa age cct gaa cct aaa cca gtc tac aag eta ggc aat att ggc 
Ala Glu Ser Pro Glu Pro Lys Pro Val Tyr Lys Leu Gly Asn He Gly 
140 "5 150 

caa ccg gtt agt gac ttg gec tta gaa att aaa get gaa tct aac ctg 
Gin Pro Val Ser Asp Leu Ala Leu Glu He Lys Ala Glu Ser Asn Leu 
155 I 60 165 



99 



147 



195 



243 



29.1 



339 



387 



435 



483 



531 



WO 03/104391 



200/235 



PCT/US02/36122 



gtt gtc gaa etc tct agt ttc caa eta cag tea ctg acc tat ttc ace 
Val Val Glu Leu Ser Ser Phe Gin Leu Gin Ser Leu Thr Tyr Phe Thr 
170 175 180 185 

cct cat ata gca gtc att acc aat att tat tec gee cac ctt gac tac 
Pro His lie Ala Val He Thr Asn He Tyr Ser Ala His Leu Asp Tyr 

190 195 200 

cat aag agt egg gag gaa tat gtt agg get aag eta agg att acc cag 
His Lys Ser Arg Glu Glu Tyr Val Arg Ala Lys Leu Arg He Thr Gin 

205 210 215 

get caa ggt ccg gat gac tac eta gtc tac tac cag ggt cag gaa gaa 
Ala Gin Gly Pro Asp Asp Tyr Leu Val Tyr Tyr Gin Gly Gin Glu Glu 
220 225 230 

ttg get age ctg gtc aaa aaa tac tct aaa gec cag ctg gtc ccc tat 
Leu Ala Ser Leu Val Lys Lys Tyr Ser Lys Ala Gin Leu Val Pro Tyr 
235 240 245 

act gac aag ggt caa ctg aac caa gga gee tat ate aag gat gac tat 
Thr Asp Lys Gly Gin Leu Asn Gin Gly Ala Tyr He Lys Asp Asp Tyr 
250 255 260 265 

ctt ate tat aat caa gag cca gtc atg get tta gac cga gtt caa gtt 
Leu He Tyr Asn Gin Glu Pro Val Met Ala Leu Asp Arg Val Gin Val 

270 275 280 



579 



aaa ata aag ggg etc tct aac caa acc att gec caa get gtc aac cac 
Lys He Lys Gly Leu Ser Asn Gin Thr He Ala Gin Ala Val Asn His 
300 305 310 



egg ctt ttt gtc aac gac tct aag gca acc aat age ttg gee aca cag 
Arg Leu Phe Val Asn Asp Ser Lys Ala Thr Asn Ser Leu Ala Thr Gin 
330 335 340 345 

aag gca tta gaa gee tat gac caa gat acc ate ttg tta gtg ggt ggc 
Lys Ala Leu Glu Ala Tyr Asp Gin Asp Thr He Leu Leu Val Gly Gly 

350 355 360 



gtt aag ggg gtc gtt tgt ttt ggc cag acc aaa gat aag tta gee egg 
Val Lys Gly Val Val Cys Phe Gly Gin Thr Lys Asp Lys Leu Ala Arg 
380 385 390 



627 



675 



723 



771 



819 



867 



tct ggt age cac aac tta caa aat att tta gca get gtt tgc gta get 915 
Ser Gly Ser His Asn Leu Gin Asn He Leu Ala Ala Val Cys Val Ala 

285 290 295 



963 



ttc aaa ggg gtt gee cac cgc age cag gtg gtt ggg egg tat gag gac 1011 
Phe Lys Gly Val Ala His Arg Ser Gin Val Val Gly Arg Tyr Glu Asp 
315 320 325 



1059 



1107 



eta gac cgc caa gat gat ttt tec aag ctt gac cat get eta aac agg 1155 
Leu Asp Arg Gin Asp Asp Phe Ser Lys Leu Asp His Ala Leu Asn Arg 

365 370 375 



1203 
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tat ttt aaa gac cgt cac att gag ggt gtt gag ctt gee cag aca gtt 
Tyr Phe Lys Asp Arg His He Glu Gly Val Glu Leu Ala Gin Thr Val 
395 400 405 



cct aaa aca gtt gat ttg get tac gac ttg agt gag cca gga caa gtc 
Pro Glu pll Val Asp Leu Ala Tyr Asp Leu Ser Glu Pro Gly Gin Val 
410 420 

at-i- tta ttt tct cct get tgt gca agt tgg gac caa tat get aac ttt 
£. llu Phe Ser Pro Ala Cys Ala Ser Trp Asp Gin Tyr Ala Asn Phe 

430 435 



aaa aaa aga gga caa gat tat gtt gat gca ate cag cag ctg gtt gaa 
Glu Glu S oly Gin Lp Tyr Val Asp Ala lie Gin Gin Leu Val Glu 

445 450 45b 



aga eta gag caa agg age aag tat gga aac taa 
Arg Leu Glu Gin Arg Ser Lys Tyr Gly Asn 
460 465 



<210> 90 
<211> 467 
<212> PRT 

<213> Alloiocdccus otitidis 



M e t°vai°Asp Ser Val Phe Cys Asn Lys Lys Val Leu Val Leu Gly Leu 

c; 10 -*-=> 



1299 



1347 



1395 



1428 



Ala Lys Ser Gly Leu Ser Ala Ala His Leu Leu Lys Lys Leu Gly Ala 

20 25 30 



Lys 



Val lie Val Asn Asp Lys Leu Ala Leu Glu Asn Asn Thr Glu Ala 



35 



40 



45 



Gin Val Leu lie Glu Glu Gly Phe 
50 55 

Glu Asp Leu Leu Asp Ala Ser Phe 
65 70 

He Pro Tyr Thr Asn Pro Val Val 

85 



Gin Val He Thr Gly Tyr His Pro 

60 



Asp Phe Val Val Lys Asn Pro Gly 
75 80 



Gly Gin Ala Glu Lys Leu Ala He 
90 * 5 



Pro He Leu Thr Glu Val Asp Val Ala Gly Ser He Leu Lys Ala Lys 

100 



105 HO 



Pro He Ala Val Thr Gly Thr Asn Gly Lys Thr Thr Thr Val Ser Leu 
115 120 125 
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He Tyr Asp He Leu Ala Gin Asp Gin Ala Glu Ser Pro Glu Pro Lys 
130 135 14° 

Pro Val Tyr Lys Leu Gly Asn He Gly Gin Pro Val Ser Asp Leu Ala 

^L- ^3 

Leu Glu He Lys Ala Glu Ser Asn Leu Val Val Glu Leu Ser Ser Phe 

165 170 I 75 

Gin Leu Gin Ser Leu Thr Tyr Phe Thr Pro His He Ala Val He Thr 

180 185 I 90 

Asn He Tyr Ser Ala His Leu Asp Tyr His Lys Ser Arg Glu Glu Tyr 
195 200 205 

Val Arg Ala Lys Leu Arg He Thr Gin Ala Gin Gly Pro Asp Asp Tyr 
210 215 220 

Leu Val Tyr Tyr Gin Gly Gin Glu Glu Leu Ala Ser Leu Val Lys Lys 
225 230 235 

Tvr Ser Lys Ala Gin Leu Val Pro Tyr Thr Asp Lys Gly Gin Leu Asn 

245 250 255 

Gin Gly Ala Tyr He Lys Asp Asp Tyr Leu He Tyr Asn Gin Glu Pro 

260 265 270 

Val Met Ala Leu Asp Arg Val Gin Val Ser Gly Ser His Asn Leu Gin 
275 280 285 

Asn He Leu Ala Ala Val Cys Val Ala Lys He Lys Gly Leu Ser Asn 
290 295 300 

Gin Thr He Ala Gin Ala Val Asn His Phe Lys Gly Val Ala His Arg 
305 310 315 320 

Ser Gin Val Val Gly Arg Tyr Glu Asp Arg Leu Phe Val Asn Asp Ser 

325 330 335 

Lys Ala Thr Asn Ser Leu Ala Thr Gin Lys Ala Leu Glu Ala Tyr Asp 

340 345 350 
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Gin Asp Thr lie Leu Leu Val Gly Gly Leu Asp Arg Gin Asp Asp Phe 
355 360 365 

Ser Lys Leu Asp His Ala Leu Asn Arg Val Lys Gly Val Val Cys Phe 
370 375 380 

Gly Gin Thr Lys Asp Lys Leu Ala Arg Tyr Phe Lys Asp Arg His lie 
385 390 395 400 

Glu Gly Val Glu Leu Ala Gin Thr Val Pro Glu Ala Val Asp Leu Ala 

405 410 415 

Tyr Asp Leu Ser Glu Pro Gly Gin Val He Leu Phe Ser Pro Ala Cys 

420 425 430 

Ala Ser Trp Asp Gin Tyr Ala Asn Phe Glu Glu Arg Gly Gin Asp Tyr 
435 440 445 

Val Asp Ala lie Gin Gin Leu Val Glu Arg Leu Glu Gin Arg Ser Lys 
450 455 460 



Tyr Gly Asn 
465 



<210> 91 
<211> 651 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . - (651) 

<223> 

<400> 91 

actagt atg aag caa aaa act caa gcg aca gcg gtc aac cag acc caa 

Met Lys Gin Lys Thr Gin Ala Thr Ala Val Asn Gin Thr Gin 
1 5 10 

aca gag gca gaa gaa aga caa gaa acc cgt egg aaa att ggc etc atg 
Thr Glu Ala Glu Glu Arg Gin Glu Thr Arg Arg Lys He Gly Leu Met 
15 20 25 30 

ggg ggg acc ttt aat ccg ccc cat ctg ggt cat tta ctg gta get gaa 
Gly Gly Thr Phe Asn Pro Pro His Leu Gly His Leu Leu Val Ala Glu 

35 40 45 

caa gtt tat gag gec ttg gac ttg gat aat att cac ttt atg ccc act 



48 



96 



144 



192 
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Gin Val Tyr Glu Ala Leu Asp Leu Asp Asn Xle His Phe Met Pro Thr 

50 55 60 

gca aag ccg ggc cat gcc get ggt aag gaa acc ata gat gec tct tac 
Ala Lys Pro Gly His Ala Ala Gly Lys Glu Thr lie Asp Ala Ser Tyr 
65 70 75 

egg gtt gat atg gtg gat tat gcc ate gaa gat aac ccc cac ttt tct 
Arg Val Asp Met Val Asp Tyr Ala lie Glu Asp Asn Pro His Phe Ser 
80 65 9? 

ctt aac ttg act gaa gtg aac egg gga ggg aca act tac acc ate gat 
Leu Asn Leu Thr Glu Val Asn Arg Gly Gly Thr Thx Tyr Thr lie Asp 
95 100 105 HO 

acc att aaa gaa ttg aaa gag get age ccg aat aca gat tat tac ttc 
Thr He Lys Glu Leu Lys Glu Ala Ser Pro Asn Thr Asp Tyr Tyr Phe 

115 120 125 

att att ggt gag gat tea gtt atg gat ttg gcc cag tgg aag aat att 
He He Gly Glu Asp Ser Val Met Asp Leu Ala Gin Trp Lys Asn He 

130 135 140 

gaa caa tta ctg gat tta gtt caa ttt gtt ggt gtg aag cga cca ggc 
Glu Gin Leu Leu Asp Leu Val Gin Phe Val Gly Val Lys Arg Pro Gly 
145 150 155 

tac caa get gat gtg gac ttt ccc att att tgg gtg gat acg cca gaa 
Tyr Gin Ala Asp Val Asp Phe Pro He He Trp Val Asp Thr Pro Glu 
160 165 170 

eta gat att agt tea agt gac ate agg caa agg gtg gca. gaa ggg caa 
Leu Asp He Ser Ser Ser Asp He Arg Gin Arg Val Ala Glu Gly Gin 
175 180 185 190 

tec att aaa tat ttg acc cca gat agg gta aga gat tat att gaa gac 
Ser He Lys Tyr Leu Thr Pro Asp Arg Val Arg Asp Tyr He Glu Asp 

195 200 205 

-aat ggc tta tat aag ggt gaa gaa taa 
Asn Gly Leu Tyr Lys Gly Glu Glu 

210 



240 



288 



336 



384 



432 



480 



528 



576 



624 



651 



<210> 92 
<211> 214 
<212> PRT 

<213> Alloiococcus otitidis 

<400> 92 ^ _ 

Met Lys Gin Lys Thr Gin Ala Thr Ala Val Asn Gin Thr Gin Thr Glu 
i q 10 15 



Ala Glu Glu Arg Gin Glu Thr Arg Arg Lys He Gly Leu Met Gly Gly 

20 25 30 
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Thr Phe Asn Pro Pro His Leu Gly His Leu Leu Val Ala Glu Gin Val 
35 40 45 

Tyr Glu Ala Leu Asp Leu Asp Asn He His Phe Met Pro Thr Ala Lys 
50 55 60 

Pro Gly His Ala Ala Gly Lys Glu Thr He Asp Ala Ser Tyr Arg Val 
65 70 75 80 

Asn Met Val Asp Tyr Ala He Glu Asp Asn Pro His Phe Ser Leu Asn 

85 90 95 

Leu Thr Glu Val Asn Arg Gly Gly Thr Thr Tyr Thr He Asp Thr He 

100 105 11° 

Lys Glu Leu Lys Glu Ala Ser Pro Asn Thr Asp Tyr Tyr Phe He He 
115 120 125 

Gly Glu Asp Ser Val Met Asp Leu Ala Gin Trp Lys Asn He Glu Gin 
130 135 140 

Leu Leu Asp Leu Val Gin Phe Val Gly Val Lys Arg Pro Gly Tyr Gin 
145 150 155 160 

Ala Asp Val Asp Phe Pro He He Trp Val Asp Thr Pro Glu Leu Asp 

165 170 175 

He Ser Ser Ser Asp He Arg Gin Arg Val Ala Glu Gly Gin Ser He 

180 185 190 

Lys Tyr Leu Thr Pro Asp Arg Val Arg Asp Tyr lie Glu Asp Asn Gly 
195 200 205 



Leu Tyr Lys Gly Glu Glu 
210 



<210> 93 
<211> 666 
<212> DMA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (1) . . (666) 



WO 03/104391 



206/235 



PCT/US02/36122 



<223> 



<400> 93 

atg gta ggg gga etc att ttt gtc etc act gec age aat aaa agg aaa 

Met Val Gly Gly Leu He Phe Val Leu Thr Ala Ser Asn Lys Arg Lys 

1 5 10 15 

gga agt ttg tec atg acc tat ttg tta ggc eta ace ggt ggc att gee 
Gly Ser Leu Ser Met Thr Tyr Leu Leu Gly Leu Ttir Gly Gly He Ala 

20 25 30 

agt ggg aag tct act gtt age cag gtt ttt aag gaa aag ggt ate caa 
Ser Gly Lys Ser Thr Val Ser Gin Val Phe Lys Glu Lys Gly He Gin 
35 40 45 

gtg gtt gat get gac cga gtt gee cga cag gtt gtt gaa cct gga agt 
Val Val Asp Ala Asp Arg Val Ala Arg Gin Val Val Glu Pro Gly Ser 
50 55 60 



cca ggc tta gac cag ctt gtt gat tat ttt ggc cag gag att ttg acc 

Pro Gly Leu Asp Gin Leu Val Asp Tyr Phe Gly Gin Glu He Leu Thr 

*7n 7 S 80 

65 70 /D 



cag gat ggg ggc ttg gac cgc aaa tat tta ggc gac ctt ate ttc egg 
Gin Asp Gly Gly Leu Asp Arg Lys Tyr Leu Gly Asp Leu He Phe Arg 

85 90 95 

aat age cag gec aag gag get gtc aac egg ate etc cac cct ttg att 
Asn Ser Gin Ala Lys Glu Ala Val Asn Arg He Leu Hxs Pro Leu He 

100 105 11° 

agg cag tct ate caa aat caa att aaa act gee ata ggc caa gac ttg 
Arg Gin Ser He Gin Asn Gin He Lys Thr Ala He Gly Gin Asp Leu 
115 120 125 

gat ttg tta gtt tta gac ate ccc etc ctt tac gag aca ggt cag gca 
Asp Leu Leu Val Leu Asp He Pro Leu Leu Tyr Glu Thr Gly Gin Ala 
130 135 140 

gac gac tac cag gee gtc atg gtg gtt teg ctt ccc tac cag gac cag 
Asp Asp Tyr Gin Ala Val Met Val Val Ser Leu Pro Tyr Gin Asp Gin 
145 150 155 160 

gtg agt egg tta atg gac egg gat ggg att gac cga gac caa gee ctg 
Val Ser Arg Leu Met Asp Arg Asp Gly He Asp Arg Asp Gin Ala Leu 

165 170 175 

cgc aag att cag gec caa atg tea ttg gaa gaa aaa gtg aag ttg gcg 
Arg Lys He Gin Ala Gin Met Ser Leu Glu Glu Lys Val Lys Leu Ala 

180 185 190 

gac tat gtc att gat aac age gga age aag gaa gaa age cgt cag cag 
Asp Tyr Val He Asp Asn Ser Gly Ser Lys Glu Glu Ser Arg Gin Gin 
195 200 205 

gtt gaa get tgg ttg gat caa aag ggt ttt aaa aac ttg taa 
Val Glu Ala Trp Leu Asp Gin Lys Gly Phe Lys Asn Leu 



48 



96 



144 



19 



n 



240 



288 



336 



384 



432 



480 



528 



576 



624 



666 
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210 215 



220 



<210> 94 
<211> 221 
<212> PRT 

<213> Alloiococcus ofcitidis 

Met°val 4 Gly Gly Leu lie Phe Val Leu Thr Ala Ser Asn Lys Arg Lys 
1 5 10 « 

Gly Ser Leu Ser Met Thr Tyr Leu Leu Gly Leu Thr Gly Gly He Ala 

20 25 3° 

Ser Gly Lys Ser Thr Val Ser Gin Val Phe Lys Glu Lys Gly He Gin 
35 4° 45 

Val Val Asp Ala Asp Arg Val Ala Arg Gin Val Val Glu Pro Gly Ser 
50 55 60 

Pro Gly Leu Asp Gin Leu Val Asp Tyr Phe Gly Gin Glu He Leu Thr 
65 70 75 80 

Gin Asp Gly Gly Leu Asp Arg Lys Tyr Leu Gly Asp Leu lie Phe Arg 

85 90 95 

Asn Ser Gin Ala Lys Glu Ala Val Asn Arg He Leu His Pro Leu He 

100 105 HO 

Arg Gin Ser He Gin Asn Gin He Lys Thr Ala He Gly Gin Asp Leu 
115 120 125 

Asp Leu Leu Val Leu Asp He Pro Leu Leu Tyr Glu Thr Gly Gin Ala 
130 135 I 40 

Asp Asp Tyr Gin Ala Val Met Val Val Ser Leu Pro Tyr Gin Asp Gin 
145 150 155 160 

Val Ser Arg Leu Met Asp Arg Asp Gly He Asp Arg Asp Gin Ala Leu 

165 170 I 75 

Arg Lys He Gin Ala Gin Met Ser Leu Glu Glu Lys Val Lys Leu Ala 

.180 185 19° 



WO 03/104391 



208/235 



PCT/US02/36122 



Asp Tyr Val He Asp Asn Ser Gly Ser Lys Glu Glu Ser Arg Gin Gin 
195 200 205 

Val Glu Ala Trp Leu Asp Gin Lys Gly Phe Lys Asn Leu 
210 215 220 



<210> 95 
<211> 1335 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4).. (1335) 

<223> 

gQt°atg 5 gac caa gac acc ate tat cac ttt gtt ggc att aaa gga tct 48 
Me? Lp Gin Asp Thr He Tyr His Phe Val Gly He Lys Gly Ser 



15 10 15 

ggc atg agt tea ctt gec act ate ttg ttt gac aag ggc tta aat gtc 
Gly Met Ser Ser Leu Ala Thr He Leu Phe Asp Lys Gly Leu Asn Val 

20 25 30 

eaa gga tct gat gtc aaa aag tat ttc ttt acc caa aaa age tta gaa 
Gin Gly Ser Asp Val Lys Lys Tyr Phe Phe Thr Gin Lys Ser Leu Glu 

35 40 45 

gaa aaa aat ata aac att tta gaa ttt gae ect gat aac ate aaa cca 
Glu Lys Asn He Asn He Leu Glu Phe Asp Pro Asp Asn He Lys Pro 
50 55 60 

ggt atg acc ctg ata gca ggc aat gec ttt gga gac aac cat ccc gag 
Gly Met Thr Leu He Ala Gly Asn Ala Phe Gly Asp Asn Hxs Pro Glu 
65 70 75 

ctg gtc cga ggt cga gag etc ggt tta gaa ate ate cgc tac cat .gat 
Leu Val Arg Gly Arg Glu Leu Gly Leu Glu He He Arg Tyr Hxs Asp 
80 85 90 95 

ttt ate ggt gae ctt ate gaa cac ttt act tee ate get att acc ggg 
Phe He Gly Asp Leu He Glu His Phe Thr Ser He Ala He Thr Gly 

100 105 11° 



tct cac ggt aag acc tec aca act ggt ttg atg gec cat gtt ttc tec 
Ser His Gly Lys Thr Ser Thr Thr Gly Leu Met Ala His Val Phe Ser 

115 120 125 

ggt att gat age acc tec tac tta att gga gat ggg acc ggc cat ggg 
Gly He Lp Ser Thr Ser Tyr Leu He Gly Asp Gly Thr Gly His Gly 
130 135 140 



gaa aaa ggt gec aag tat ttt gtc ttg gaa gee tge gaa tac aag egg 
Glu Lys Gly Ala Lys Tyr Phe Val Leu Glu Ala Cys Glu Tyr Lys Arg 



96 



144 



192 



240 



288 



336 



364 



432 



480 
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145 



150 155 



cac ttt ttg gcc tac cga ccg gac tat gcg gtt atg acc aat att gac 
His HI Leu La Tyr Arg Pro Asp Tyr Ala Val Met Thr Asn lie Asp 
160 165 170 

ttt gac cac ccg gac tat tac aag tct att gaa gat gtc caa gtg gcc 
Phe Asp His Pro Asp Tyr Tyr Lys Ser lie Glu Asp Val Gin Val Ala 

ttt gat gaa ttc age cac cag gtc aaa aaa tac etc ttt gcc tgc ggg 
Phe Asp Glu Phe Ser His Gin Val Lys Lys Tyr Leu Phe Ala Cys Gly 

195 200 205 

gac gac caa cgt ctt egg cag gtc aaa gcc cag gtg ccg gtc att tac 
Asp Asp Gin Arg Leu Axg Gin Val Lys Ala Gin Val Pro Val He Tyr 
210 215 220 

tac ggt eta aat gaa gac aat gac ttt gtg get aaa aac ate gac cga 
Tyr Gly Leu Asn Glu Asp Asn Asp Phe Val Ala Lys Asn He Asp Arg 
225 230 235 

agt cgt gaa ggg tct gcc ttc gac ctt tat att aag gga gaa ttt tac 
Ser Arg Glu Gly Ser Ala Phe Asp Leu Tyr lie Lys Gly Glu Pne Tyr 
240 245 250 



aaa cac ttc acc ate eca acc tat ggc aac cac aat att caa aat gcc 
Lys His Phe Thr He Pro Thr Tyr Gly Asn His Asn He Gin Asn Ala 
Y 260 265 270 

ttg gcg gtt ata gca gta get tac tac gaa ggg tta gac caa gat ttg 
Leu All Val He Ala Val Ala Tyr Tyr Glu Gly Leu Asp Gin Asp Leu 

275 280 285 

att gcc caa aga ttg get aat ttt get ggg gtg aaa cgc egg ttt acc 
£l Ala Gin Arg Leu Ala Asn Phe Ala Gly Val Lys Arg Arg Phe Thr 



290 



295 3°° 



gag aag gtg gtc ggg gac act act att ate gat gac tat get cac cac 
Glu Lys Val Val Gly Asp Thr Thr He He Asp Asp Tyr Ala Hxs His 
305 31° 315 



cct get gaa ata agg gca acg att gat gcg gcc egg caa aaa tac ccg 
III III Glu He Arg Ala Thr He Asp Ala Ala Arg Gin Lys Tyr Pro 
320 325 330 335 

gac aag gac att gtg acg gtc ttc cag ccc cac acc ttt acc egg aca 
Asp Lys Lp He Val Thr Val Phe Gin Pro His Thr Phe Thr Arg Thr 

340 345 ->= u 



gtc gcc etc eta gat gaa ttt gcc cag gcc ttg gac ttg gca gac cag 
Val Ala Leu Leu Asp Glu Phe Ala Gin Ala Leu Asp Leu Ala Asp Gin 

355 360 365 

ctt tac ttg tgt gat ate ttt aat tea get aga gaa aag tea ggc gat 
?S T^r Leu SS ASP He Phe Asn Ser Ala Arg Glu Lys Ser Gly Asp 

375 380 



370 
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528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 
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att tec ate caa gat ctt ttg get aaa acc age aag gee gac cag gtg 
lie Ser He Gin Asp Leu Leu Ala Lys Thr Ser Lys Ala Asp Gin Val 
385 390 395 

att gag gaa gac gat gtg tct cct ctg ctt gac caa cat ggg caa gtg 
He Glu Glu Asp Asp Val Ser Pro Leu Leu Asp Gin His Gly Gin Val 
400 405 410 415 

att att ttc atg gga gca gga gac ate age aag ttt gaa aaa gec tat 
He lie Phe Met Gly Ala Gly Asp He Ser Lys Phe Glu Lys Ala Tyr 

420 425 430 

gaa age ttg ttg age tea acc tac cac tec cag gtc taa 
Glu Ser Leu Leu Ser Ser Thr Tyr His Ser Gin Val 

435 440 



<210> 96 
<211> 443 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 96 

Met Asp Gin Asp Thr He Tyr His Phe Val Gly He Lys Gly Ser Gly 
15 10 15 

Met Ser Ser Leu Ala Thr He Leu Phe Asp Lys Gly Leu Asn Val Gin 

20 25 30 

Gly Ser Asp Val Lys Lys Tyr Phe Phe Thr Gin Lys Ser Leu Glu Glu 
35 40 45 

Lys Asn He Asn He Leu Glu Phe Asp Pro Asp Asn He Lys Pro Gly 
50 55 60 



Met Thr Leu He Ala Gly Asn Ala Phe Gly Asp Asn His Pro Glu Leu 
65 70 75 80 

Val Arg Gly Arg Glu Leu Gly Leu Glu He He Arg Tyr His Asp Phe 

85 90 95 



He Gly Asp Leu He Glu His Phe Thr Ser He Ala He Thr Gly Ser 

100 105 HO 

His Gly Lys Thr Ser Thr Thr Gly Leu Met Ala His Val Phe Ser Gly 
115 120 125 

He Asp Ser Thr Ser Tyr Leu He Gly Asp Gly Thr Gly His Gly Glu 



1200 



1248 



1296 



1335 
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130 



135 I 40 



Lys Gly Ala Lys Tyr Phe Val Leu Glu Ala Cys Glu Tyr Lys Arg His 
145 150 155 

Phe Leu Ala Tyr Arg Pro Asp Tyr Ala Val Met Thr Asn lie Asp Phe 

165 170 175 

Asp His Pro Asp Tyr Tyr Lys Ser He Glu Asp Val Gin Val Ala Phe 

180 185 190 

Asp Glu Phe Ser His Gin Val Lys Lys Tyr Leu Phe Ala Cys Gly Asp 
195 200 205 

Asp Gin Arg Leu Arg Gin Val Lys Ala Gin Val Pro Val He Tyr Tyr 
■ 210 215 220 

Gly Leu Asn Glu Asp Asn Asp Phe Val Ala Lys Asn lie Asp Arg Ser 
225 230 235 

Arg Glu Gly Ser Ala Phe Asp Leu Tyr He Lys Gly Glu Phe Tyr Lys 

245 250 255 

His Phe Thr lie Pro Thr Tyr Gly Asn His Asn Xle Gin Asn Ala Leu 

260 265 270 

Ala Val He Ala Val Ala Tyr Tyr Glu Gly Leu Asp Gin Asp Leu Val 
275 280 285 

Ala Gin Arg Leu Ala Asn Phe Ala Gly Val Lys Arg Arg Phe Thr Glu 
290 295 300 

Lvs Val Val Gly Asp Thr Thr He He Asp Asp Tyr Ala His His Pro 
305 310 315 320 

Ala Glu He Arg Ala Thr He Asp Ala Ala Arg Gin Lys Tyr Pro Asp 

325 330 335 

Lys Asp He Val Thr Val Phe Gin Pro His Thr Phe Thr Arg Thr Val 

340 345 350 

Ala Leu Leu Asp Glu Phe Ala Gin Ala Leu Asp Leu Ala Asp Gin Val 
355 360 365 
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Tyr Leu Cys Asp He Phe Asn Ser Ala Arg Glu Lys Ser Gly Asp He 
370 375 380 

Ser He Gin Asp Leu Leu Ala Lys Thr Ser Lys Ala Asp Gin Val He 
385 390 395 400 

Glu Glu Asp Asp Val Ser Pro Leu Leu Asp Gin His Gly Gin Val He 

405 410 415 



He Piie Met Gly Ala Gly Asp He Ser Lys Phe Glu Lys Ala Tyr Glu 

420 425 430 



Ser Leu Leu Ser Ser Thr Tyr His Ser Gin Val 
435 440 



<210> 97 
<211> 1050 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (19) . . (1050) 

<223> 

<400> 97 

acaaaattat ttacgtgt atg gag gaa tta ata gtg cca tta tta gac tta 

Met Glu Glu Leu He Val Pro Leu Leu Asp Leu 
1 5 10 

aat gac cat gac cgc gtt cag gaa tat gag gac ttt gtc caa aac cac 
Asn Asp His Asp Arg Val Gin Glu Tyr Glu Asp Phe Val Gin Asn His 

15 20 25 

ccc cag ggc cac ctg atg cag tct acc aaa tgg ate cag gtt aag gaa 
Pro Gin Gly His Leu Met Gin Ser Thr Lys Trp He Gin Val Lys Glu 
30 35 40 



aag gca tgc ttg tec att eta tea gtc aaa aat gac gga gaa cat gec 
Lys Ala Cys Leu Ser He Leu Ser Val Lys Asn Asp Gly Glu His Ala 
60 65 70 75 

ttc tta tat gcg cca aga ggg ccg gtt tgt gac ttt cat gat aca gac 
Phe Leu Tyr Ala Pro Arg Gly Pro Val Cys Asp Phe His Asp Thr Asp 

80 85 90 



51 



99 



147 



ggc tgg gac ggt gac tat gtt tac ctt acc gat gac caa gac egg ate 195 
Gly Trp Asp Gly Asp Tyr Val Tyr Leu Thr Asp Asp Gin Asp Arg He 
45 50 55 



243 



291 
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ttg gtg acc gac tta att aag gaa gcc caa gtc gta gcg gac aag cac 33 9 

Leu Val Thr Asp Leu lie Lys Glu Ala Gin Val Val Ala Asp Lys His 

95 100 105 

aag gcc ttt ttg ttg egg atg gac ccg gaa acc ctt cat gat cct gac 387 
Lys Ala Phe Leu Leu Arg Met Asp Pro Glu Thr Leu His Asp Pro Asp 
110 115 120 

ctg gtc gaa aaa tac cgc gat tta ggc tat act ttc egg tea get gag 435 
Leu Val Glu Lys Tyr Arg Asp Leu Gly Tyr Thr Phe Arg Ser Ala Glu 
125 130 135 

caa gaa gat gaa cac gtc ttc tec aac ccc cgc ttc cac atg atg acg 483 
Gin Glu Asp Glu His Val Phe Ser Asn Pro Arg Phe His Met Met Thr 
140 145 150 155 

gac tta agg ggt cat gat gaa gaa age ttg ctg atg gcc ttc acc age 531 
Asp Leu Arg Gly His Asp Glu Glu Ser Leu Leu Met Ala Phe Thr Ser 

160 165 170 

aat aac egg cgc aag ate cgc aaa act tac aaa aat aac etc cag acc 579 
Asn Asn Arg Arg Lys lie Arg Lys Thr Tyr Lys Asn Asn Leu Gin Thr 

175 180 185 

cac tat ctg acc gtg gat gat gag ggt tat gac cag gcc ttg gat gac 627 
His Tyr Leu Thr Val Asp Asp Glu Gly Tyr Asp Gin Ala Leu Asp Asp 
190 195 200 

ttt tat gaa ttg acc caa ata atg gca gaa egg caa ggg att act cac 675 
Phe Tyr Glu Leu Thr Gin He Met Ala Glu Arg Gin Gly He Thr His 
205 . 210 215 

egg ccc aaa gac tac ttt gac egg tta atg cac age ttt gag gat get 723 
Arg Pro Lys Asp Tyr Phe Asp Arg Leu Met His Ser Phe Glu Asp Ala 
220 225 230 t 235 

aaa ttg ttc cag acc tac cac gaa gat gac etc eta get act tgt ate 771 
Lys Leu Phe Gin Thr Tyr His Glu Asp Asp Leu Leu Ala Thr Cys He 

240 245 250 

ttg gtg age tat aat aaa aaa tec ttc tac atg tat gca get tct tec 819 
Leu Val Ser Tyr Asn Lys Lys Ser Phe Tyr Met Tyr Ala Ala Ser Ser 

255 260 265 

aac aaa aaa cga aat tta aat ggg tct ttg caa gaa aat tac gaa gcc 867 
Asn Lys Lys Arg Asn Leu Asn Gly Ser Leu Gin Glu Asn Tyr Glu Ala 
270 275 280 

atg aag tat gcc ttg gcc cga gga age gaa gaa tat gat atg ggt ggg 915 
Met Lys Tyr Ala Leu Ala Arg Gly Ser Glu Glu Tyr Asp Met Gly Gly 
285 290 295 

gtc ttt ggc ttt gac aag teg gac ggc etc tac egg ttt aaa aaa ate 963 
Val Phe Gly Phe Asp Lys Ser Asp Gly Leu Tyr Arg Phe Lys Lys He 
300 305 310 315 



ttt acc ggt cat gaa ggg ctg aaa gaa ttt atg ggt gaa ttg gat gtg 



1011 
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Phe Thr Gly His Glu Gly Leu Lys Glu Phe Met Gly Glu Leu Asp Val 

320 325 330 

gtc tat gac caa gac eta tac gac gat ttt att tct taa 
Val Tyr Asp Gin Asp Leu Tyr Asp Asp Phe lie Ser 

335 " 340 



<210> 98 
<211> 343 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 98 

Met Glu Glu Leu He Val Pro Leu Leu Asp Leu Asn Asp His Asp Arg 
15 10 15 



Val Gin Glu Tyr Glu Asp Phe Val Gin Asn His Pro Gin Gly His Leu 

20 25 30 



Met Gin Ser Thr Lys Trp He Gin Val Lys Glu Gly Trp Asp Gly Asp 
35 40 45 



Tyr Val Tyr Leu Thr Asp Asp Gin Asp Arg He Lys Ala Cys Leu Ser 
50 55 60 



He Leu Ser Val Lys Asn Asp Gly Glu His Ala Phe Leu Tyr Ala Pro 
65 70 75 80 

Arg Gly Pro Val Cys Asp Phe His Asp Thr Asp Leu Val Thr Asp Leu 

85 90 95 



He Lys Glu Ala Gin Val Val Ala Asp Lys His Lys Ala Phe Leu Leu 

100 105 110 - 



Arg Met Asp Pro Glu Thr Leu His Asp Pro Asp Leu Val Glu Lys Tyr 
115 120 125 

Arg Asp Leu Gly Tyr Thr Phe Arg Ser Ala Glu Gin Glu Asp Glu His 
130 135 140 

Val Phe Ser Asn Pro Arg Phe His Met Met Thr Asp Leu Arg Gly His 
145 150 155 160 



1050 



Asp Glu Glu Ser Leu Leu Met Ala Phe Thr Ser Asn Asn Arg Arg Lys 

165 170 175 
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lie Arg Lys Thr Tyr Lys Asn Asn Leu Gin Thr His Tyr Leu Thr Val 

180 185 190 



Asp Asp Glu Gly Tyr Asp Gin Ala Leu Asp Asp Phe Tyr Glu Leu Thr 
195 200 205 



Gin lie Met Ala Glu Arg Gin Gly lie Thr His Arg Pro Lys Asp Tyr 
210 215 220 



Phe Asp Arg Leu Met His Ser Phe Glu Asp Ala Lys Leu Phe Gin Thr 
225 230 235 240 



Tyr His Glu Asp Asp Leu Leu Ala Thr Cys lie Leu Val Ser Tyr Asn 

245 250 255 



Lys Lys Ser Phe Tyr Met Tyr Ala Ala Ser Ser Asn Lys Lys Arg Asn 

260 265 270 



Leu Asn Gly Ser Leu Gin Glu Asn Tyr Glu Ala Met Lys Tyr Ala Leu 
275 280 285 



Ala Arg Gly Ser Glu Glu Tyr Asp Met Gly Gly Val Phe Gly Phe Asp 
290 295 300 



Lys Ser Asp Gly Leu Tyr Arg Phe Lys Lys lie Phe Thr Gly His Glu 
305 310 315 320 



Gly Leu Lys Glu Phe Met Gly Glu Leu Asp Val Val Tyr Asp Gin Asp 

325 330 335 



Leu Tyr Asp Asp Phe lie Ser 

340 



<210> 99 
<211> 2244 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CBS 

<222> (22) . . (2244) 

<223> 

<400> 99 

ttacgtgaaa ggaagacttg c atg ggc eta gca aaa gat att tta ggc aaa 51 
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Met Gly Leu Ala Lys Asp lie Leu Gly Lys 
15 10 

atg aat gac aaa caa aaa caa gcg gtc atg acc act gat ggc cct etc 99 
Met Asn Asp Lys Gin Lys Gin Ala Val Met Thr Thr Asp Gly Pro Leu 

15 20 25 

ttg ate atg get ggg gca gga tct ggc aag acc egg gtc tta acc cac 147 
Leu lie Met Ala Gly Ala Gly Ser Gly Lys Thr Arg Val Leu Thr His 

30 35 40 

egg ata get tac ttg ate caa gaa aaa ggg gtt aat cct tgg aat ate 195 
Arg lie Ala Tyr Leu lie Gin Glu Lys Gly Val Asn Pro Trp Asn lie 
45 50 55 

tta gec ate acc ttt acc aac aag gcg get ggc gag atg aaa gac egg 243 
Leu Ala lie Thr Phe Thr Asn Lys Ala Ala Gly Glu Met Lys Asp Arg 
60 65 70 

gtc cag aaa ctg gtt age cag gga gga tct gga gtt tgg gtc teg act 291 
Val Gin Lys Leu Val Ser Gin Gly Gly Ser Gly Val Trp Val Ser Thr 
75 80 85 90 

ttc cac tct atg tgt gtt cgc att eta aga agg gac ggg gac caa att 339 
Phe His Ser Met Cys Val Arg lie Leu Arg Arg Asp Gly Asp Gin lie 

" 95 100 105 

ggc tat aac cgt gec ttc acc att get gac cct agt gaa cag aaa agt 387 
Gly Tyr Asn Arg Ala Phe Thr lie Ala Asp Pro Ser Glu Gin Lys Ser 

110 115 120 

ttg atg aag cag gtc tta aaa gac ttg aat att gat cct aaa cgt tac 435 
Leu Met Lys Gin Val Leu Lys Asp Leu Asn lie Asp Pro Lys Arg Tyr 
125 130 135 

aac ccc aag gcg ata ttg gee gag att tec aat gec aaa aat gac etc 483 
Asn Pro Lys Ala lie Leu Ala Glu lie Ser Asn Ala Lys Asn Asp Leu 
140 145 150 

ttg gat gag caa acc tac egg aaa caa get gat gac tat ttt aag gaa 531 
Leu Asp Glu Gin Thr Tyr Arg Lys Gin Ala Asp Asp Tyr Phe Lys Glu 
155 160 165 170 

gtg gtg get gac tgc tac gat get tac caa aga cag etc cgc cag tct 579 
Val Val Ala Asp Cys Tyr Asp Ala Tyr Gin Arg Gin Leu Arg Gin Ser 

175 180 185 

gag gec atg gac ttt gac gac ctg att atg caa acc gtc cgt etc ttc 627 
Glu Ala Met Asp Phe Asp Asp Leu lie Met Gin Thr Val Arg Leu Phe 

190 195 200 

aag gaa aag ccc gat acc ctg tct tac tac cag gec aag ttc cag tat 675 
Lys Glu Lys Pro Asp Thr Leu Ser Tyr Tyr Gin Ala Lys Phe Gin Tyr 
205 210 215 



ate cat gtt gac gaa tac cag gat acc aac caa gec caa tac caa ctg 
lie His Val Asp Glu Tyr Gin Asp Thr Asn Gin Ala Gin Tyr Gin Leu 



723 
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220 225 230 

gtt caa ctg eta gec caa cgc ttt aaa aat gtt tgc gtc gtg gga gat 771 
Val Gin Leu Leu Ala Gin Arg Phe Lys Asn Val Cys Val Val Gly Asp 
235 240 245 250 

get gac cag tct att tat ggt tgg egg ggg get gat atg gga aat att 819 
Ala Asp Gin Ser He Tyr Gly Trp Arg Gly Ala Asp Met Gly Asn lie 

255 260 265 

ttg aat ttc gaa aaa gac tat cca gaa gee caa ace ate ttt ttg gaa 867 
Leu Asn Phe Glu Lys Asp Tyr Pro Glu Ala Gin Thr He Phe Leu Glu 

270 275 280 

caa aat tac egg tea acc aag tct ata ate agg gca gee aat gat gtt 915 
Gin Asn Tyr Arg Ser Thr Lys Ser He lie Arg Ala Ala Asn Asp Val 
285 290 295 

ate caa aac aat ate aac cgc egg gac aag aat ttg tgg act gec aac 963 
He Gin Asn Asn He Asn Arg Arg Asp Lys Asn Leu Trp Thr Ala Asn 
300 305 310 • 

gat gag ggg gac aag gtc age tta tac get gec egg age gag cag gat 1011 
Asp Glu Gly Asp Lys Val Ser Leu Tyr Ala Ala Arg Ser Glu Gin Asp 
315 320 325 330 

gaa gec cag ttt ate gta ggg acc ate cat gac eta aca gaa ggc aaa 1059 
Glu Ala Gin Phe He Val Gly Thr He His Asp Leu Thr Glu Gly Lys 

335 340 345 

aag get ggc tat ggg gac ate gec ate etc tac egg acc aat gee atg 1107 
Lys Ala Gly Tyr Gly Asp He Ala He Leu Tyr Arg Thr Asn Ala Met 

350 355 360 

tec egg gtt att gaa gaa acc ttt ate aag teg aat ate ccc tac aag 1155 
Ser Arg Val He Glu Glu Thr Phe He Lys Ser Asn He Pro Tyr Lys 
365 370 375 

ate gtc ggc gga acc ggc ttt tac caa aga aaa gaa ate cgt gac ctg . 1203 
He Val Gly Gly Thr Gly Phe Tyr Gin Arg Lys Glu He Arg Asp Leu 
380 385 390 

att gee tac eta acc eta gtg get aac cca get gat gac ctg tec ttt 1251 
He Ala Tyr Leu Thr Leu Val Ala Asn Pro Ala Asp Asp Leu Ser Phe 
395 400 405 410 

tea egg ate gtt aat gag ccc aaa aga ggg att gga ccc ggc acc ctg 1299 
Ser Arg He Val Asn Glu Pro Lys Arg Gly He Gly Pro Gly Thr Leu 

415 420 425 

gac aag tta cgc cag get ggc cag gag atg ggt tgg teg ctt tac gaa 1347 
Asp Lys Leu Arg Gin Ala Gly Gin Glu Met Gly Trp Ser Leu Tyr Glu 

430 435 440 

aca get etc aat gcg gat get acc aac ctg cct agt egg get gtc aac 1395 
Thr Ala Leu Asn Ala Asp Ala Thr Asn Leu Pro Ser Arg Ala Val Asn 
445 450 455 
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aga eta tta gac ttc agt caa atg att gaa aat ttc agg aaa atg acg 1443 
Arg Leu Leu Asp Phe Ser Gin Met lie Glu Asn Phe Arg Lys Met Thr 
460 465 470 

gaa tac tta ccg att act gat ttg acc gaa aaa ate tta gag gat act 1491 
Glu Tyr Leu Pro lie Thr Asp Leu Thr Glu Lys lie Leu Glu Asp Thr 
475 480 485 490 

ggc tac caa aaa gec tta gaa aaa gac egg act ctt gaa tct cag gca 1539 
Gly Tyr Gin Lys Ala Leu Glu Lys Asp Arg Thr Leu Glu Ser Gin Ala 

495 500 505 

agg tta gag aac eta cag gaa ttt tac tec gtc acc cag gaa ttt gac 1587 
Arg Leu Glu Asn Leu Gin Glu Phe Tyr Ser Val Thr Gin Glu Phe Asp 

510 515 520 

cag caa gaa gac gac aac aag tea etc tta gee ttc tta act gac ctt 163 5 

Gin Gin Glu Asp Asp Asn Lys Ser Leu Leu Ala Phe Leu Thr Asp Leu 
525 530 535 

tec tta ttg tea cca get gat gat gtt gaa gag ggt egg ggc cag gtc 1683 
Ser Leu Leu Ser Pro Ala Asp Asp Val Glu Glu Gly Arg Gly Gin Val 
540 545 550 

acc atg atg acc etc cat gca gec aag ggg ttg gaa ttc ccc tat gtc 1731 
Thr Met Met Thr Leu His Ala Ala Lys Gly Leu Glu Phe Pro Tyr Val 
555 560 565 570 

ttt ate get ggt atg gaa gag gga ate ttc ccc ttg tec egg gcg get 1779 
Phe lie Ala Gly Met Glu Glu Gly lie Phe Pro Leu Ser Arg Ala Ala 

575 580 585 

gaa gac ccg gaa age ttg gaa gaa gag cga cga ctg gee tat gta ggg 1827 
Glu Asp Pro Glu Ser Leu Glu Glu Glu Arg Arg Leu Ala Tyr Val Gly 

590 595 600 

att acc egg get gag cag gec etc tac eta acc cgt gec atg atg cgc 1875 
lie Thr Arg- Ala Glu Gin Ala Leu Tyr Leu Thr Arg Ala Met Met Arg 
605 610 615 

caa etc tat ggc egg acc cag get aat ccc aaa tct cgc ttt tta tct 1923 
Gin Leu Tyr Gly Arg Thr Gin Ala Asn Pro Lys Ser Arg Phe Leu Ser 
620 625 630 

gaa att tct tct gac ctg gtc caa gac ctt ggt get aca act ggg tct 1971 
Glu He Ser Ser Asp Leu Val Gin Asp Leu Gly Ala Thr Thr Gly Ser 
635 640 645 650 

ctt age cag act ggg ggg aaa gtt age cct aga eta gga ggc cgc aaa 2019 
Leu Ser Gin Thr Gly Gly Lys Val Ser Pro Arg Leu Gly Gly Arg Lys 

655 660 665 

gec agt ggt tat aag get aat get tgg tct cag caa tea gtt ggg gcg 2 067 

Ala Ser Gly Tyr Lys Ala Asn Ala Trp Ser Gin Gin Ser Val Gly Ala 

670 675 680 
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act ggg get gaa aaa gaa gac tgg gaa gtt ggt gac aag gtc cac cac 2115 
Thr Gly Ala Glu Lys Glu Asp Trp Glu Val Gly Asp Lys Val His His 
685 690 695 

aaa aaa tgg ggc caa gga acc att att gag att aaa ggt tct ggc teg 2163 
Lys Lys Trp Gly Gin Gly Thr He He Glu lie Lys Gly Ser Gly Ser 
700 705 710 



gac etc cag etc aac att gee ttt cca gat gaa ggg ate aag ccc ttg 

Asp Leu Gin Leu Asn He Ala Pfae Pro Asp Glu Gly He Lys Pro Leu 

715 720 725 730 

eta gec agt ttt gee ccc ate gaa aag att tag 
Leu Ala Ser Phe Ala Pro He Glu Lys lie 

735 740 



<210> 100 
<211> 740 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 100 

Met Gly Leu Ala Lys Asp He Leu Gly Lys Met Asn Asp Lys Gin Lys 
1 5 " 10 15 



Gin Ala Val Met Thr Thr Asp Gly Pro Leu Leu He Met Ala Gly Ala 

20 25 30 



Gly Ser Gly Lys Thr Arg Val Leu Thr His Arg He Ala Tyr Leu He 
35 40 45 



Gin Glu Lys Gly Val Asn Pro Trp Asn He Leu Ala He Thr Phe Thr 
50 55 60 



Asn Lys Ala Ala Gly Glu Met Lys Asp Arg Val Gin Lys Leu Val Ser 
65 70 75 80 



Gin Gly Gly Ser Gly Val Trp Val Ser Thr Phe His Ser Met Cys Val 

85 90 95 



Arg He Leu Arg Arg Asp Gly Asp Gin He Gly Tyr Asn Arg Ala Phe 

100 105 110 



Thr He Ala Asp Pro Ser Glu Gin Lys Ser Leu Met Lys Gin Val Leu 
115 120 125 



2211 



2244 



Lys Asp Leu Asn He Asp Pro Lys Arg Tyr Asn Pro Lys Ala He Leu 
130 135 140 
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Ala Glu lie Ser Asn Ala Lys Asn Asp Leu Leu Asp Glu Gin Thr Tyr 
145 150 155 160 



Arg Lys Gin Ala Asp Asp Tyr Phe Lys Glu Val Val Ala Asp Cys Tyr 

165 170 175 



Asp Ala Tyr Gin Arg Gin Leu Arg Gin Ser Glu Ala Met Asp Phe Asp 

180 185 190 



Asp Leu lie Met Gin Thr Val Arg Leu Phe Lys Glu Lys Pro Asp Thr 
195 200 205 



Leu Ser Tyr Tyr Gin Ala Lys Phe Gin Tyr lie His Val Asp Glu Tyr 
210 215 220 



Gin Asp Thr Asn Gin Ala Gin Tyr Gin Leu Val Gin Leu Leu Ala Gin 
225 230 235 240 



Arg Phe Lys Asn Val Cys Val Val Gly Asp Ala Asp Gin Ser He Tyr 

245 250 255 



Gly Trp Arg Gly Ala Asp Met Gly Asn He Leu Asn Phe Glu Lys Asp 

260 265 270 



Tyr Pro Glu Ala Gin Thr He Phe Leu Glu Gin Asn Tyr Arg Ser Thr 
275 280 285 



Lys Ser He He Arg Ala Ala Asn Asp Val He Gin Asn Asn lie Asn 
290 295 300 



Arg Arg Asp Lys Asn Leu Trp Thr Ala Asn Asp Glu Gly Asp Lys Val 
305 310 315 320 



Ser Leu Tyr Ala Ala Arg Ser Glu Gin Asp Glu Ala Gin Phe He Val 

325 330 335 



Gly Thr He His Asp Leu Thr Glu Gly Lys Lys Ala Gly Tyr Gly Asp 

340 345 350 



He Ala He Leu Tyr Arg Thr Asn Ala Met Ser Arg Val He Glu Glu 
355 360 365 
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Thr Phe He Lys Ser Asn He Pro Tyr Lys He Val Gly Gly Thr Gly 
370 375 380 



Phe Tyr Gin Arg Lys Glu He Arg Asp Leu He Ala Tyr Leu Thr Leu 
385 390 395 400 



Val Ala Asn Pro Ala Asp Asp Leu Ser Phe Ser Arg He Val Asn Glu 

405 410 415 



Pro Lys Arg Gly He Gly Pro Gly Thr Leu Asp Lys Leu Arg Gin Ala 

420 425 430 

Gly Gin Glu Met Gly Trp Ser Leu Tyr Glu Thr Ala Leu Asn Ala Asp 
435 440 445 



Ala Thr Asn Leu Pro Ser Arg Ala Val Asn Arg Leu Leu Asp Phe Ser 
450 455 460 



Gin Met He Glu Asn Phe Arg Lys Met Thr Glu Tyr Leu Pro He Thr 
465 470 475 480 



Asp Leu Thr Glu Lys He Leu Glu Asp Thr Gly Tyr Gin Lys Ala Leu 

485 490 495 



Glu Lys Asp Arg Thr Leu Glu Ser Gin Ala Arg Leu Glu Asn Leu Gin 

500 505 510 



Glu Phe Tyr Ser Val Thr Gin Glu Phe Asp Gin Gin Glu Asp Asp Asn 
515 520 525 



Lys Ser Leu Leu Ala Phe Leu Thr Asp Leu Ser Leu Leu Ser Pro Ala 
530 535 540 



Asp Asp Val Glu Glu Gly Arg Gly Gin Val Thr Met Met Thr Leu His 
545 550 555 560 



Ala Ala Lys Gly Leu Glu Phe Pro Tyr Val Phe He Ala Gly Met Glu 

565 570 575 



Glu Gly He Phe Pro Leu Ser Arg Ala Ala Glu Asp Pro Glu Ser Leu 

580 585 590 
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Glu Glu Glu Arg Arg Leu Ala Tyr Val Gly lie Thr Arg Ala Glu Gin 
595 600 605 



Ala Leu Tyr Leu Thr Arg Ala Met Met Arg Gin Leu Tyr Gly Arg Thr 
610 615 620 



Gin Ala Asn Pro Lys Ser Arg Phe Leu Ser Glu lie Ser Ser Asp Leu 
625 630 635 640 



Val Gin Asp Leu Gly Ala Thr Thr Gly Ser Leu Ser Gin Thr Gly Gly 

645 650 655 



Lys Val Ser Pro Arg Leu Gly Gly Arg Lys Ala Ser Gly Tyr Lys Ala 

660 665 670 



Asn Ala Trp Ser Gin Gin Ser Val Gly Ala Thr Gly Ala Glu Lys Glu 
675 680 685 



Asp Trp Glu Val Gly Asp Lys Val His His Lys Lys Trp Gly Gin Gly 
690 695 700 



Thr He He Glu He Lys Gly Ser Gly Ser Asp Leu Gin Leu Asn He 
705 710 715 720 



Ala Phe Pro Asp Glu Gly He Lys Pro Leu Leu Ala Ser Phe Ala Pro 

725 730 735 



He Glu Lys He 

740 



<210> 101 
<211> 1314 
<212> DMA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (1314) 

<223> 

<400> 101 

agt atg gac aca ate gtc att caa gga gga gac aat cga ctt gag ggt 48 

Met Asp Thr He Val He Gin Gly Gly Asp Asn Arg Leu Glu Gly 
1 5 10 15 



aca gtc aag gta gaa ggg get aag aat get gec ctt cct ate ctg get 
Thr Val Lys Val Glu Gly Ala Lys Asn Ala Ala Leu Pro He Leu Ala 



96 



WO 03/104391 



223/235 



PCT/US02/36122 



20 25 30 

gcc agt ctt tta cca gaa gat ggg aaa agt cac ctg tec aat gtc ccc 144 

Ala Ser Leu Leu Pro Glu Asp Gly Lys Ser His Leu Ser Asn Val Pro 

35 40 45 

tta eta tct gat att tac acg atg caa gaa gtt ttg cgt tac tta aac 192 

Leu Leu Ser Asp lie Tyr Thr Met Gin Glu Val Leu Arg Tyr Leu Asn 

50 55 60 

gtt gac att gac ttc gat gaa gac cac aac gaa ate gtc ata gat get 240 

Val Asp lie Asp Phe Asp Glu Asp His Asn Glu lie Val lie Asp Ala 
65 70 75 

aca gga gac ctg aat tec aat acc cct tat gaa ttt atg age aag atg 288 

Thr Gly Asp Leu Asn Ser Asn Thr Pro Tyr Glu Phe Met Ser Lys Met 
80 85 90 95 

egg get tec ate att gtc atg ggt ccc tta eta gcc cgt aat ggt tat 336 

Arg Ala Ser He He Val Met Gly Pro Leu Leu Ala Arg Asn Gly Tyr 

100 105 110 

gcc aaa gtc get ctt cct ggt ggt tgc gcg att ggg act cgt cct att 384 

Ala Lys Val Ala Leu Pro Gly Gly Cys Ala He Gly Thr Arg Pro He 

115 120 125 

gac ttg cac tta aaa ggc ttc egg get atg ggg gtc gat gtg gaa gtc 432 

Asp Leu His Leu Lys Gly Phe Arg Ala Met Gly Val Asp Val Glu Val 

130 135 140 

gaa gga ggt tat gtg ate gcc aca gtt caa gat gaa ctg gat ggc get 480 

Glu Gly Gly Tyr Val He Ala Thr Val Gin Asp Glu Leu Asp Gly Ala 
145 150 155 

gat att tac ctt gac ttc cca agt gtt gga get aca caa aat att ttg 528 

Asp He Tyr Leu Asp Phe Pro Ser Val Gly Ala Thr Gin Asn lie Leu 
160 165 170 175 

atg get gcc acc egg gca aaa ggg aca aca gtc ate gag aat gca get 57 6 

Met Ala Ala Thr Arg Ala Lys Gly Thr Thr Val He Glu Asn Ala Ala 

180 185 190 

cga gaa cct gaa att gtt gac ctt gcc aac tat ttg aac aag atg ggt 624 

Arg Glu Pro Glu He Val Asp Leu Ala Asn Tyr Leu Asn Lys Met Gly 

195 200 205 

gcc cgt att tac ggg gcc gga acc aat acc atg aga att gaa ggg gta 672 

Ala Arg He Tyr Gly Ala Gly Thr Asn Thr Met Arg He Glu Gly Val 

210 215 220 

gac aag eta gaa get tgt gac cac tec att att gcc gac egg ata gaa 720 

Asp Lys Leu Glu Ala Cys Asp His Ser He He Ala Asp Arg He Glu 
225 230 235 

agt ggc acc ttt atg gta gca get ggt gtc acc caa ggg aat gtc ttg 7 68 

Ser Gly Thr Phe Met Val Ala Ala Gly Val Thr Gin Gly Asn Val Leu 
240 245 250 255 
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att gaa gac tgt ate gtc gaa cac aac cgc ccc tta att tec aag tta 816 
lie Glu Asp Cys lie Val Glu His Asn Arg Pro Leu lie Ser Lys Leu 

260 265 270 

agt gaa atg ggc gtt caa ttt gag gaa gaa aaa acc ggc ctt cga gtc 864 
Ser Glu Met Gly Val Gin Phe Glu Glu Glu Lys Thr Gly Leu Arg Val 

275 280 285 

atg gga cca gag acc tta cag gca aca gat gtt aaa acc ctg cct tat 912 
Met Gly Pro Glu Thr Leu Gin Ala Thr Asp Val Lys Thr Leu Pro Tyr 
290 295 300 

cct ggc ttc cca act gat atg cag tea ccg atg aca gtc gec caa acc 960 
Pro Gly Phe Pro Thr Asp Met Gin Ser Pro Met Thr Val Ala Gin Thr 
305 310 315 

eta get gag gga aga age ate atg aga gaa acg gtc ttc gaa aac cgc 1008 
Leu Ala Glu Gly Arg Ser lie Met Arg Glu Thr Val Phe Glu Asn Arg 
320 325 330 335 

ttc atg cac atg gaa gag ctt cgt aaa atg gat gca caa ttt act gtc 1056 
Phe Met His Met Glu Glu Leu Arg Lys Met Asp Ala Gin Phe Thr Val 

340 345 350 

gat ggc cag tec ctt att ate gag ggg ggc aaa aaa etc caa ggt get 1104 
Asp Gly Gin Ser Leu lie lie Glu Gly Gly Lys Lys Leu Gin Gly Ala 

355 360 365 

aga gtc cag tec agt gac ttg egg get tea get tec ttg att att get 1152 
Arg Val Gin Ser Ser Asp Leu Arg Ala Ser Ala Ser Leu lie lie Ala 
370 375 380 

ggt tta gta get gat ggt gtc acc aaa gta acc aat ctt aac cac tta 12 00 

Gly Leu Val Ala Asp Gly Val Thr Lys Val Thr Asn Leu Asn His Leu 
385 390 395 

gac egg ggc tac tat aaa ttt cac gaa aaa tta cag caa tta ggt get 1248 
Asp Arg Gly Tyr Tyr Lys Phe His Glu Lys Leu Gin Gin Leu Gly Ala 
400 405 410 415 

tec att gaa cga ate gac gag gaa att caa gtt gac cag gaa gec age 129 6 

Ser lie Glu Arg lie Asp Glu Glu He Gin Val Asp Gin Glu Ala Ser 

420 425 430 

etc aaa aaa ggc gaa taa 1314 
Leu Lys Lys Gly Glu 

435 



<210> 102 
<211> 436 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 102 

Met Asp Thr He Val He Gin Gly Gly Asp Asn Arg Leu Glu Gly Thr 
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10 15 



Val Lys Val Glu Gly Ala Lys Asn Ala Ala Leu Pro He Leu Ala Ala 

20 25 30 



Ser Leu Leu Pro Glu Asp Gly Lys Ser His Leu Ser Asn Val Pro Leu 
35 40 45 



Leu Ser Asp He Tyr Thr Met Gin Glu Val Leu Arg Tyr Leu Asn Val 
50 55 60 



Asp He Asp Phe Asp Glu Asp His Asn Glu He Val He Asp Ala Thr 
65 70 75 80 



Gly Asp Leu Asn Ser Asn Thr Pro Tyr Glu Phe Met Ser Lys Met Arg 

85 90 95 



Ala Ser He He Val Met Gly Pro Leu Leu Ala Arg Asn Gly Tyr Ala 

100 105 110 



Lys Val Ala Leu Pro Gly Gly Cys Ala He Gly Thr Arg Pro He Asp 
115 120 125 



Leu His Leu Lys Gly Phe Arg Ala Met Gly Val Asp Val Glu Val Glu 
130 135 140 



Gly Gly Tyr Val He Ala Thr Val Gin Asp Glu Leu Asp Gly Ala Asp 
145 150 155 160 



He Tyr Leu Asp Phe Pro Ser Val Gly Ala Thr Gin Asn He Leu Met 

165 170 175 



Ala Ala Thr Arg Ala Lys Gly Thr Thr Val He Glu Asn Ala Ala Arg 

180 185 190 



Glu Pro Glu He Val Asp Leu Ala Asn Tyr Leu Asn Lys Met Gly Ala 
195 200 205 



Arg He Tyr Gly Ala Gly Thr Asn Thr Met Arg He Glu Gly Val Asp 
210 215 220 



Lys Leu Glu Ala Cys Asp His Ser Xle He Ala Asp Arg He Glu Ser 
225 230 235 240 
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Gly Thr Phe Met Val Ala Ala Gly Val Thr Gin Gly Asn Val Leu He 

245 250 255 



Glu Asp Cys He Val Glu His Asn Arg Pro Leu He Ser Lys Leu Ser 

260 265 270 



Glu Met Gly Val Gin Phe Glu Glu Glu Lys Thr Gly Leu Arg Val Met 
275 280 285 



Gly Pro Glu Thr Leu Gin Ala Thr Asp Val Lys Thr Leu Pro Tyr Pro 
290 295 300 



Gly Phe Pro Thr Asp Met Gin Ser Pro Met Thr Val Ala Gin Thr Leu" 
305 310 . 315 . 320 



Ala Glu Gly Arg Ser lie Met Arg Glu Thr Val Phe Glu Asn Arg Phe 

325 330 335 



Met His Met Glu Glu Leu Arg Lys Met Asp Ala Gin Phe Thr Val Asp 

340 345 350 



Gly Gin Ser Leu He He Glu Gly Gly Lys Lys Leu Gin Gly Ala Arg 
355 360 365 



Val Gin Ser Ser Asp Leu Arg Ala Ser Ala Ser Leu He He Ala Gly 
370 375 380 



Leu Val Ala Asp Gly Val Thr Lys Val Thr Asn Leu Asn His Leu Asp 
385 390 395 400 



Arg Gly Tyr Tyr Lys Phe His Glu Lys Leu Gin Gin Leu Gly Ala Ser 

405 410 415 



He Glu Arg He Asp Glu Glu He Gin Val Asp Gin Glu Ala Ser Leu 

420 425 430 



Lys Lys Gly Glu 
435 



<210> 103 
<211> 1026 
<212> DNA 
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<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (34) . . (1026) 

<223> 

<400> 103 

acagttttaa tccagttagc cacaaggtgg gat atg atg gac tta gca gaa aaa 

Met Met Asp Leu Ala Glu Lys 
1 5 

caa gca ggg gtc tac caa ctt ttt gac cga ate ctg gec aac cat gec 
Gin Ala Gly Val Tyr Gin Leu Phe Asp Arg lie Leu Ala Asn His Ala 
10 15 20 

etc aag cat gec tat ctt ttt gaa ggt ttg gec gga tea ggc aaa ctg 
Leu Lys His Ala Tyr Leu Phe Glu Gly Leu Ala Gly Ser Gly Lys Leu 
25 30 35 

gag atg age egg tat att gec aag aga ctg ttt tgc ccc aac caa gac 
Glu Met Ser Arg Tyr lie Ala Lys Arg Leu Phe Cys Pro Asn Gin Asp 
40 45 50 55 

cag gga caa get tgc caa gtt tgt ccc act tgc ttg cgc att gac cag 
Gin Gly Gin Ala Cys Gin Val Cys Pro Thr Cys Leu Arg lie Asp Gin 

60 65 70 

ggt caa cac cct gat gtg gta gaa ata gee cct gag ggg aag gga egg 
Gly Gin His Pro Asp Val Val Glu lie Ala Pro Glu Gly Lys Gly Arg 

75 80 85 

teg att agg gta gac egg gta cga cag gtc aag gat gee eta age aag 
Ser lie Arg Val Asp Arg Val Arg Gin Val Lys Asp Ala Leu Ser Lys 
90 95 100 

tct ggt gtg gag agt caa aag aaa atg att ate ctt aac cag get gat 
Ser Gly Val Glu Ser Gin Lys Lys Met lie lie Leu Asn Gin Ala Asp 
105 110 115 

aaa atg acc ccc agt gca gee aac age ctg ctt aaa ttt ctg gaa gag 
Lys Met Thr Pro Ser Ala Ala Asn Ser Leu Leu Lys Phe Leu Glu Glu 
120 125 130 135 

ccg gca ggg gat gtg act att ttc ttg tta gtt act age egg caa aac 
Pro Ala Gly Asp Val Thr lie Phe Leu Leu Val Thr Ser Arg Gin Asn 

140 145 150 

ctt ttg cca act att gtt tec cgc tgc cag gtt ate cag ttt gec aag 
Leu Leu Pro Thr He Val Ser Arg Cys Gin Val He Gin Phe Ala Lys 

155 160 165 

cag gat tta aag act egg att gag gac tta gtg gaa gec ggt ttg tec 
Gin Asp Leu Lys Thr Arg He Glu Asp Leu Val Glu Ala Gly Leu Ser 
170 175 180 



cag gaa gaa gec cac ttg gec age cac etc age caa gac tta gac ttg 
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Gin Glu Glu Ala His Leu Ala Ser His Leu Ser Gin Asp Leu Asp Leu 
185 190 195 

get aag tec etc att gag gaa gag gac ttg ctg gca gtc agt caa aaa 678 
Ala Lys Ser Leu lie Glu Glu Glu Asp Leu Leu Ala Val Ser Gin Lys 
200 205 210 215 

att tgg cag tgg ttt age tat etc atg aac caa gat gac ttg gec ttt 726 
lie Trp Gin Trp Phe Ser Tyr Leu Met Asn Gin Asp Asp Leu Ala Phe 

220 225 230 

ate eta gtc caa aga gac tta atg gec ttt ate caa gac egg gat gac 774 
lie Leu Val Gin Arg Asp Leu Met Ala Phe lie Gin Asp Arg Asp Asp 

235 240 245 

tgc cag atg gtt tgt gac tta ate etc tac etc ttc caa gac ctg etc 822 
Cys Gin Met Val Cys Asp Leu lie Leu Tyr Leu Phe Gin Asp Leu Leu 
250 255 260 

cac tta cac tac cat tta gat agt ccg gec tgc ttc gca ggc cac gaa 870 
His Leu His Tyr His Leu Asp Ser Pro Ala Cys Phe Ala Gly His Glu 
265 270 275 

agt gac etc cgc tac ttt atg gac ctg ctt teg ate aag caa gtg tct 918 
Ser Asp Leu Arg Tyr Phe Met Asp Leu Leu Ser lie Lys Gin Val Ser 
280 285 290 295 

tat gec atg caa gec ace ctg caa get aaa aga gaa gtg gac cac aat 966 
Tyr Ala Met Gin Ala Thr Leu Gin Ala Lys Arg Glu Val Asp His Asn 

300 305 310 

gtg gee agt cag get gtt tta gaa ggc ttg act ttg gac ttg cag gaa 1014 
Val Ala Ser Gin Ala Val Leu Glu Gly Leu Thr Leu Asp Leu Gin Glu 

315 320 325 

agt ata ggc taa 1026 
Ser He Gly 
330 



<210> 104 
<211> 330 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 104 

Met Met Asp Leu Ala Glu Lys Gin Ala Gly Val Tyr Gin Leu Phe Asp 
15 10 15 



Arg He Leu Ala Asn His Ala Leu Lys His Ala Tyr Leu Phe Glu Gly 

20 25 30 



Leu Ala Gly Ser Gly Lys Leu Glu Met Ser Arg Tyr He Ala Lys Arg 
35 40 45 
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Leu Phe Cys Pro Asn Gin Asp Gin Gly Gin Ala Cys Gin Val Cys Pro 
50 55 60 



Thr Cys Leu Arg lie Asp Gin Gly Gin His Pro Asp Val Val Glu lie 
65 70 75 80 



Ala Pro Glu Gly Lys Gly Arg Ser lie Arg Val Asp Arg Val Arg Gin 

85 90 95 



Val Lys Asp Ala Leu Ser Lys Ser Gly Val Glu Ser Gin Lys Lys Met 

100 105 110 



lie He Leu Asn Gin Ala Asp Lys Met Thr Pro Ser Ala Ala Asn Ser 
115 120 125 



Leu Leu Lys Phe Leu Glu Glu Pro Ala Gly Asp Val Thr He Phe Leu 
130 135 140 



Leu Val Thr Ser Arg Gin Asn Leu Leu Pro Thr He Val Ser Arg Cys 
145 150 155 160 



Gin Val He Gin Phe Ala Lys Gin Asp Leu Lys Thr Arg He Glu Asp 

165 170 175 



Leu Val Glu Ala Gly Leu Ser Gin Glu Glu Ala His Leu Ala Ser His 

180 185 190 



Leu Ser Gin Asp Leu Asp Leu Ala Lys Ser Leu He Glu Glu Glu Asp 
195 200 205 



Leu Leu Ala Val Ser Gin Lys He Trp Gin Trp Phe Ser Tyr Leu Met 
210 215 220 



Asn Gin Asp Asp Leu Ala Phe He Leu Val Gin Arg Asp Leu Met Ala 
225 230 235 240 



Phe lie Gin Asp Arg Asp Asp Cys Gin Met Val Cys Asp Leu lie Leu 

245 250 255 



Tyr Leu Phe Gin Asp Leu Leu His Leu His Tyr His Leu Asp Ser Pro 

260 265 270 
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Ala Cys Phe Ala Gly His Glu Ser Asp Leu Arg Tyr Phe Met Asp Leu 
275 280 285 



Leu Ser lie Lys Gin Val Ser Tyr Ala Met Gin Ala Thr Leu Gin Ala 
290 295 300 



Lys Arg Glu Val Asp His Asn Val Ala Ser Gin Ala Val Leu Glu Gly 
305 310 315 320 



Leu Thr Leu Asp Leu Gin Glu Ser lie Gly 

325 330 



<210> 105 
<211> 1785 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (13) . . (1785) 
<223> 

<400> 105 

gaggggagag ct atg acc cac cag gcc tta tac egg gta tgg cga ccg caa 51 

Met Thr His Gin Ala Leu Tyr Arg Val Trp Arg Pro Gin 
15 10 

agt ttt get gat gta tec ggc cag cat gtg gtc acc aag acc eta aag 99 
Ser Phe Ala Asp Val Ser Gly Gin His Val Val Thr Lys Thr Leu Lys 
15 20 25 

aat gcc att aaa aat gat aat acc agt cat gcc tac ctg ttt act gga 147 
Asn Ala lie Lys Asn Asp Asn Thr Ser His Ala Tyr Leu Phe Thr Gly 
30 35 40 45 

ccc egg ggg acg ggc aag acc agt gtg gca aaa ata ttt gcc aag gcc 195 
Pro Arg Gly Thr Gly Lys Thr Ser Val Ala Lys lie Phe Ala Lys Ala 

50 55 60 

att aat tgc ccc tac teg gat gat ggg gag cct tgt aat gaa tgt cag 243 
lie Asn Cys Pro Tyr Ser Asp Asp Gly Glu Pro Cys Asn Glu Cys Gin 

65 70 75 

att tgc cag gag ate acc cag ggt agt eta ggc gat gtc ate gaa ate 291 
lie Cys Gin Glu He Thr Gin Gly Ser Leu Gly Asp Val He Glu He 
80 85 90 

gat gcg gcc age aat aat ggg gtg gaa gag att cgc gat att agg gaa 339 
Asp Ala Ala Ser Asn Asn Gly Val Glu Glu He Arg Asp He Arg Glu 
95 100 105 



aag get aat tat gcc cca act teg gcc gtt tac aag gtc tac att ate 387 
Lys Ala Asn Tyr Ala Pro Thr Ser Ala Val Tyr Lys Val Tyr He He 
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110 115 120 125 

gat gag gtc cat atg tta tec tct ggg gec ttt aac gec etc ttg aaa 435 
Asp Glu Val His Met Leu Ser Ser Gly Ala Phe Asn Ala Leu Leu Lys 

130 135 140 

aca ctg gaa gag cct cca gec aat gtg gtc ttt ate tta gca acg act 483 
Thr Leu Glu Glu Pro Pro Ala Asn Val Val Phe lie Leu Ala Thr Thx 

145 150 155 

gaa ccc cac aag att ccg get acc att ate tec egg ace cag cgt ttt 531 
Glu Pro His Lys lie Pro Ala Thr lie lie Ser Arg Thr Gin Arg Phe 
160 165 170 

gat ttt aag egg att gac aac cag gac ate ate gac cgc ttg att tat 579 
Asp Phe Lys Arg lie Asp Asn Gin Asp lie lie Asp Arg Leu lie Tyr 
175 180 185 

ate tta gaa gaa gac cag gtc ccc tac age aaa gaa gee gtc eta age 627 
lie Leu Glu Glu Asp Gin Val Pro Tyr Ser Lys Glu Ala Val Leu Ser 
190 195 200 205 

eta gee aat gca gcg gaa ggt ggg atg egg gat gee ttg agt atg ttg 675 
Leu Ala Asn Ala Ala Glu Gly Gly Met Arg Asp Ala Leu Ser Met Leu 

210 215 220 

gac cag gee tta age ttt atg aca gat gag tta aca gaa gaa gtt gec 723 
Asp Gin Ala Leu Ser Phe Met Thr Asp Glu Leu Thr Glu Glu Val Ala 

225 230 235 

etc cag att aca ggg age att acc cag tct etc ttg ctt gaa tac ttg 771 
Leu Gin lie Thr Gly Ser lie Thr Gin -Ser Leu Leu Leu Glu Tyr Leu 
240 245 250 

cag gtg att age caa ggt cag acg gaa gaa gga etc aag etc ttg caa 819 
Gin Val lie Ser Gin Gly Gin Thr Glu Glu Gly Leu Lys Leu Leu Gin 
255 260 265 

gaa gtt tta ggg gaa ggc aag gac cct age egg ttt gtg gaa gac get 867 
Glu Val Leu Gly Glu Gly Lys Asp Pro Ser Arg Phe Val Glu Asp Ala 
270 275 280 285 

att atg atg acc egg gac etc ttg ctt tac caa act age caa ggc gat 915 
He Met Met Thr Arg Asp Leu Leu Leu Tyr Gin Thr Ser Gin Gly Asp 

290 295 300 

aat ttt gtt cct aaa ttg get cgc tta gac gac cag ttt gaa gac ctg 963 
Asn Phe Val Pro Lys Leu Ala Arg Leu Asp Asp Gin Phe Glu Asp Leu 

305 310 315 

gcg aag gac ttg gac aag gag atg gec tac cat att att gat gtc tta 1011 
Ala Lys Asp Leu Asp Lys Glu Met Ala Tyr His He He Asp Val Leu 
320 325 330 

aac caa acc caa gac gat etc cgc eta age aac cat ggg gaa gtc tat 1059 
Asn Gin Thr Gin Asp Asp Leu Arg Leu Ser Asn His Gly Glu Val Tyr 
335 340 345 
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ttg gaa ata gcc acg gtc aag ctt age cag cct tct tea gec gtt cag 
Leu Glu lie Ala Thr Val Lys Leu Ser Gin Pro Ser Ser Ala Val Gin 
350 355 360 365 



1107 



acc ate cag gcc age caa gtc aac atg gtg gac cag gat aat aaa gaa 1155 
Thr He Gin Ala Ser Gin Val Asn Met Val Asp Gin Asp Asn Lys Glu 

370 375 380 



gag att gcc caa ctg caa aac cag gtc aag tec etc cag caa agt att 
Glu He Ala Gin Leu Gin Asn Gin Val Lys Ser Leu Gin Gin Ser He 

385 390 395 

caa aac ttg caa get gga gcc aaa caa ggg cct aag caa aga get aag 
Gin Asn Leu Gin Ala Gly Ala Lys Gin Gly Pro Lys Gin Arg Ala Lys 
400 405 410 

tea aaa get ggc ccc aag caa tct ggc cct ggc aag tct aga age cac 
Ser Lys Ala Gly Pro Lys Gin Ser Gly Pro Gly Lys Ser Arg Ser His 
415 420 425 

cgt cac cag caa ggc ttc aag gtt aac egg aaa gcc gtt tac tct ate 
Arg His Gin Gin Gly Phe Lys Val Asn Arg Lys Ala Val Tyr Ser He 
430 435 440 445 



cca gac ttg ate aat gtc ttg acc ate agt caa aag get ate tta aac 
Pro Asp Leu He Asn Val Leu Thr He Ser Gin Lys Ala He Leu Asn 

465 470 475 

aat tec aaa cca gtt get get agt cca gag ggt ttg gtg gtg acc ttt 
Asn Ser Lys Pro Val Ala Ala Ser Pro Glu Gly Leu Val Val Thr Phe 
480 485 490 

gaa tat gat att eta tgt gag aga gca gag tct gac gag acc ttg caa 
Glu Tyr Asp He Leu Cys Glu "Arg Ala Glu Ser Asp Glu Thr Leu Gin 
495 500 505 

acg get ate ggc aat tac ate gaa aaa att ate ggc cgc cgt cca aga 
Thr Ala He Gly Asn Tyr He Glu Lys He He Gly Arg Arg Pro Arg 
510 515 520 525 

ctg gtc tgt gtg cct gaa gac aag tgg ccg act ate cgc cgc gat ttt 
Leu Val Cys Val Pro Glu Asp Lys Trp Pro Thr He Arg Arg Asp Phe 

530 535 540 

ate aag cag atg aaa aaa gaa gat ggc agt act aaa get ggc caa gca 
lie Lys Gin Met Lys Lys Glu Asp Gly Ser Thr Lys Ala Gly Gin Ala 

545 550 555 



1203 



1251 



1299 



1347 



ttg gac cag gcg acc cgt aaa gac ctg gac gac etc caa gac etc tgg 1395 
Leu Asp Gin Ala Thr Arg Lys Asp Leu Asp Asp Leu Gin Asp Leu Trp 

450 455 460 



1443 



1491 



1539 



1587 



1635 



1683 



agt gac ggc aag teg gat gat gac cca ggt caa gaa gac aac cag gcc 1731 
Ser Asp Gly Lys Ser Asp Asp Asp Pro Gly Gin Glu Asp Asn Gin Ala 
560 565 570 
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ctt aac aag get gtg gag ctt ttc ggt aaa gac aat att aca ate aaa 1779 
Leu Asn Lys Ala Val Glu Leu Phe Gly Lys Asp Asn lie Thr lie Lys 
575 580 585 

gat taa 1785 

Asp 

590 



<210> 106 
<211> 590 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 106 

Met Thr His Gin Ala Leu Tyr Arg Val Trp Arg Pro Gin Ser Phe Ala 
1 5 10 15 



Asp Val Ser Gly Gin His Val Val Thr Lys Thr Leu Lys Asn Ala lie 

20 25 30 



Lys Asn Asp Asn Thr Ser His Ala Tyr Leu Phe Thr Gly Pro Arg Gly 
35 40 45 



Thr Gly Lys Thr Ser Val Ala Lys He Phe Ala Lys Ala He Asn Cys 
50 55 60 



Pro Tyr Ser Asp Asp Gly Glu Pro Cys Asn Glu Cys Gin He Cys Gin 
65 70 75 80 



Glu He Thr Gin Gly Ser Leu Gly Asp Val He Glu He Asp Ala Ala 

85 90 95 



Ser Asn Asn Gly Val Glu Glu He Arg Asp He Arg Glu Lys Ala Asn 

100 105 110 



Tyr Ala Pro Thr Ser Ala Val Tyr Lys Val. Tyr He He Asp Glu Val 
115 120 125 



His Met Leu Ser Ser Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu 
130 135 140 



Glu Pro Pro Ala Asn Val Val Phe He Leu Ala Thr Thr Glu Pro His 
145 150 155 160 



Lys Xle Pro Ala Thr He He Ser Arg Thr Gin Arg Phe Asp Phe Lys 

165 170 175 
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Arg lie Asp Asn Gin Asp lie lie Asp Arg Leu lie Tyr lie Leu Glu 

180 185 190 



Glu Asp Gin Val Pro Tyr Ser Lys Glu Ala Val Leu Ser Leu Ala Asn 
195 200 205 



Ala Ala Glu Gly Gly Met Arg Asp Ala Leu Ser Met Leu Asp Gin Ala 
210 215 220 



Leu Ser Phe Met Thr Asp Glu Leu Thr Glu Glu Val Ala Leu Gin He 
225 230 235 240 



Thr Gly Ser lie Thr Gin Ser Leu Leu Leu Glu Tyr Leu Gin Val He 

245 250 255 



Ser Gin Gly Gin Thr Glu Glu Gly Leu Lys Leu Leu Gin Glu Val Leu 

260 265 270 



Gly Glu Gly Lys Asp Pro Ser Arg Phe Val Glu Asp Ala He Met Met 
275 280 285 



Thr Arg Asp Leu Leu Leu Tyr Gin Thr Ser Gin Gly Asp Asn Phe Val 
290 295 300 



Pro Lys Leu Ala Arg Leu Asp Asp Gin Phe Glu Asp Leu Ala Lys Asp 
305 310 315 320 



Leu Asp Lys Glu Met Ala Tyr His He He Asp Val Leu Asn Gin Thr 

325 330 335 



Gin Asp Asp Leu Arg Leu Ser Asn His Gly Glu Val Tyr Leu Glu He 

340 345 350 



Ala Thr Val Lys Leu Ser Gin Pro Ser Ser Ala Val Gin Thr He Gin 
355 360 365 



Ala Ser Gin Val Asn Met Val Asp Gin Asp Asn Lys Glu Glu He Ala 
370 375 380 



Gin Leu Gin Asn Gin Val Lys Ser Leu Gin Gin Ser He Gin Asn Leu 
385 390 395 400 
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Gin Ala Gly Ala Lys Gin Gly Pro Lys Gin Arg Ala Lys Ser Lys Ala 

405 410 415 

Gly Pro Lys Gin Ser Gly Pro Gly Lys Ser Arg Ser His Arg His Gin 

420 425 430 

Gin Gly Phe Lys Val Asn Arg Lys Ala Val Tyr Ser lie Leu Asp Gin 
435 440 445 

Ala Thr Arg Lys Asp Leu Asp Asp Leu Gin Asp Leu Trp Pro Asp Leu 
450 455 460 

He Asn Val Leu Thr He Ser Gin Lys Ala He Leu Asn Asn Ser Lys 
46 5 470 475 480 

Pro Val Ala Ala Ser Pro Glu Gly Leu Val Val Thr Phe Glu Tyr Asp 

485 490 495 

He Leu Cys Glu Arg Ala Glu Ser Asp Glu Thr Leu Gin Thr Ala He 

500 505 510 

Gly Asn Tyr He Glu Lys He He Gly Arg Arg Pro Arg Leu Val Cys 
515 520 525 

Val Pro Glu Asp Lys Trp Pro Thr He Arg Arg Asp Phe He Lys Gin 
530 535 540 

Met Lys Lys Glu Asp Gly Ser Thr Lys Ala Gly Gin Ala Ser Asp Gly 
545 550 555 560 

Lys Ser Asp Asp Asp Pro Gly Gin Glu Asp Asn Gin Ala Leu Asn Lys 

565 570 575 



Ala Val Glu Leu Phe Gly Lys Asp Asn He Thr He Lys Asp 

580 585 590 



